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100 bp 



F94L 




nt419(del7-inslO) 


ATTGATCAGTTCGATGTC 




TGCTTCTTTAAATT.- TAGC 


I D Q F D V 




C F F K F S 


ATTGATCAGTTAGATGTC 


TGCTTCTAAGCATACAATAGC 


I D Q L O V 




CP* 



nt374 -51(T- C> 
GTTTGTTTTTCAT 
GTTCATTTTTCAT 
nt374-50(G-A) 





nt748-78<dell) 
OATTAG' 
GATTAGTTT„CTTTCTTT 



Q204X 




nt821(delll) 


GOTATTTGGCAGAGCATT 




TGTGATGAACACTCCACACA 


O I W Q S I 




C 0 E H S T E 


OGTATTTCGTAGAGCATT 




TGTG ACAGA 


0 I W * 




C R KID* 



nt374-16(d«ll) 
TTTTTTTCTTATTCA 
TTTTTT,,,CT7ATTCA 



nt414<C-T) 
CCCAAATGTTGCTTCTTT 
P X C C F F 
CCCAAATGTTGTTTCTTT 
P K C C P P 



E32SX 
TTACGCATTGAAATCAAA 
L 0 I E I K 
TTAGGCATTTAAATCAAA 
L G I * 



C313Y 
TCTGGAQAATGTGAATTT 
S O E C E P 
TCTGGAOAATATGAATTT 
S G E YEP 



(57) Abstract 



A gene (cDNA) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as SEQ ID NO: 1 and the protein 
sequence is identified as SEQ ID NO:2. A mutant gene (SEQ ID NO:3) in which the coding sequence lacks an 1 1-base pair consecutive 
sequence (SEQ ID NO: 11) of the sequence encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are double-muscled. A method for determining 
the presence of muscular hyperplasia in a mammal is described. The method includes obtaining a sample of material containing DNA from 
the mammal and ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity of myostatin, is present, and 
whether a sequence of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of (a) and the presence 
of (b) indicates the presence of muscular hyperplasia in the mammal. 
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MUTATIONS IN THE MYOSTATIN GENE CAUSE DOUBLE-MUSCLING IN MAMMALS 
Field of The Invention 

This invention relates to factors affecting muscle development in mammals, 
especially livestock. In particular, this invention relates to the cloning of the myostatin gene, a 
5 member of the TGF-p superfamily, its involvement in muscular hyperplasia in livestock, and a 
method for determining myostatin genotypes. 

Description Of Related Art 

The TGF-p superfamily consists of a group of multifunctional polypeptides which 
control a wide range of differentiation processes in many mammalian cell types. GDF-8 is a 

1 0 member of the TGF-p superfamily. All members of this superfamily share a common structure 
including a short peptide signal for secretion and an N-terminal peptide fragment that is 
separated from the bioactive carboxy-terminal fragment by proteolytic cleavage at a highly 
conserved proteolytic cleavage site. The bioactive carboxy-terminal domain is characterized by 
cysteine residues at highly conserved positions which are involved in intra- and intermolecular 

15 disulfide bridges. The functional molecules are covalently linked (via a S-S bond) dinners of the 
carboxy-termina! domain (Masterson et a/., 1 996). 

Recently, it was reported that mice deficient in the gene coding for GDF-8 were 
characterized by a generalized muscular hyperplasia (McPherron etal., 1997). The GDF-8 
deficient mice were produced by gene targeting using homologous recombination in embryonic 

20 stem cells, a method referred to as "gene knock-out". The murine generalized muscular 
hyperplasia appeared to be very similar in its expression to the muscular hyperplasia 
characterizing "double-muscled" cattle. This observation raised the intriguing possibility that the 
bovine gene coding for GDF-8 (i.e. the bovine evolutionary homologue of the mouse GDF-8 
gene) is involved in the bovine double-muscling phenotype. It also raised the possibility that the 

25 human genes coding for GDF^8 (i.e. the human evolutionary homologue of the mouse GDF-8 
gene) is involved in regulating muscular development in humans, specifically skeletal muscle 
genesis. Isolation of the human GDF-8 gene may have therapeutic uses/applications in the 
treatment of muscuiodegenerative diseases through upgrading or downgrading the expression of 
GDF-8. 

30 The occurrence of animals characterized by a distinct generalized muscular 

hypertrophy, commonly known as "double-muscled" animals, has been reported in several cattle 
breeds around the world. The first documented description of double-muscled cattle dates back 
as early as 1807 (Culley, 1807). One of the breeds in which this characteristic has been most 
thoroughly analyzed is the Belgian Blue Cattle Breed ("Belgian Blue Breed"). This is one of the 

35 only breeds where the double-muscled trait has been systematically selected for, and where the 
double-muscied phenotype is virtually fixed. A comparison of double-muscled and conventional 
animals within the Belgian Blue Breed, showed an increase in muscle mass by 20% on average, 
while all other organs were reduced in size (Hanset, 1986 and 1991). The muscular hypertrophy 
was shown to be an histological hyperplasia affecting primarily superficial muscles, 

40 accompanied by a 50% reduction in total lipid content and a reduction in connective tissue 
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fraction as measured by hydroxyproline content (Hanset et a/., 1982). Double-muscled animals 
were shown to have a reduced feed intake with improved feed conversion ratio (Hanset et a/., 
1987). An important economic benefit of double-muscled animals, in contrast to conventional 
animals, is the substantial increase in selling price and net income for the farmer (Hanset et a/., 
5 1987). 

One of the most thorough series of studies on double-muscling is that of Hanset 
and colleagues in the Belgian Blue Breed. Objective criteria of muscular development, such as 
dressing-out percentage, lean and fat percentage, plasma and red cell creatine and creatinine 
concentrations, were measured on nearly 150 randomly selected animals raised in standardized 
10 conditions. These studies clearly revealed abnormal, birnodal distributions of the double- 
muscled phenotype and objectively confirmed the visual classification traditionally performed by 
breeders on double-muscled arid conventional animals. The phenotypic distribution was 
resolved using a maximum likelihood procedure into two component normal populations with a 
common variance which revealed mean differences of three to four standard deviations 
15 depending on the trait. This suggested the presence of an allele having a major effect on 

muscular development with a population frequency close to 50% (Hanset and Michaux, 1985b). 
The most convincing evidence in favour of such an allele, however, came from experimental 
crosses involving double-muscled Belgian Blue sires and Holstein Friesian dairy cows (the latter 
animals having very poor muscular development). While F1 offspring showed a phenotypic 
20 distribution very similar to their Holstein Friesian dams, backcrossing these F1 's to double- 
muscled sires produced a birnodal BC generation, clearly pointing towards the Mendelian 
segregation of a recessive "mh" (muscular hypertrophy) allele (Hanset and Michaux., 1985a). 

The same kind of experimental crosses were subsequently used to perform a 
whole genome scan using a microsatellite based marker map. To perform the linkage analysis, 
25 animals were classified as double-muscled or conventional. Very significant Logarithm of the 
Odds scores (lodscores) were obtained on chromosome 2 (> 17), and multi point linkage 
analysis positioned the mh locus at the centromeric end of this chromosome, at [2]centimorgan 
from the nearest microsatellite marker: TGLA44. The corresponding chromosomal region 
accounted for all the variance of the trait assumed to be fully penetrant in this experiment 
30 (Chariieref a/., 1995). 

In humans, genes coding for some forms of muscular abnormalities have been 
isolated, e.g. muscular dystrophy. The present invention provides for the gene which regulates 
the development of skeletal muscle only, as opposed to other types of muscle, e.g. smooth or 
cardiac muscle. The present invention may provide an understanding of the role of the GDF-8 
35 gene or its receptor in the regrowth of skeletal muscle in humans which only undergo a 
hyperplasic response. 
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Summary of the Invention 

The present inventors have identified and sequenced a gene (cDNA and 
genomic) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as 
SEQ ID NO:1 and the protein sequence is identified as SEQ ID NO:2. The genomic bovine 
5 sequence is identified as SEQ ID NO:54. A mutant gene (SEQ ID NO:3) in which the coding 
sequence lacks an 1 1 -base pair consecutive sequence (SEQ ID NO:1 1) of the sequence 
encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are 
double-muscled. Other bovine mutations which lead to double-muscling in have also been 
10 determined, being identified herein as nt419(del7-ins10), Q204X, E226X and C313Y, 
respectively. 

In one aspect, the present invention thus provides a method for determining the 
presence of muscular hyperplasia in a mammal. The method includes obtaining a sample of 
material containing DNA from the mammal and ascertaining whether a sequence of the DNA 
15 encoding (a) a protein having biological activity of myostatin, is present, and whether a sequence 
of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of 
(a) and the presence of (b) indicates the presence of muscular hyperplasia in the mammal. 

Of course, the mutation responsible for the lack of activity can be a naturally 
occurring mutation, as is the case for the Belgian Blue, Asturiana, Parthenaise or Rubia Gallega 
20 breeds, shown here. 

The mammal can be a human, bovine, etc. 

There are several methods known for determining whether a particular 
nucleotide sequence is present in a sample. A common method is the polymerase chain 
reaction. A preferred aspect of the invention thus includes a step in which ascertaining whether a 

25 sequence of the DNA encoding (a) is present, and whether a sequence of the DNA encoding (b) 
is present includes amplifying the DNA in the presence of primers based on a nucleotide 
sequence encoding a protein having biological activity of myostatin. 

A primer of the present invention, used in PCR for example, is a nucleic acid 
molecule sufficiently complementary to the sequence on which it is based and of sufficient length 

30 to selectively hybridize to the corresponding portion of a nucleic acid molecule intended to be 
amplified and to prime synthesis thereof under in vitro conditions commonly used in PCR. 
Likewise, a probe of the present invention, is a molecule, for example a nucleic acid molecule of 
sufficient length and sufficiently complementary to the nucleic acid molecule of interest, which 
selectively binds under high or low stringency conditions with the nucleic acid sequence of 

35 interest for detection thereof in the presence of nucleic acid molecules having differing 
sequences. 

In preferred aspects, primers are based on the sequence identified as SEQ ID 
NO:7 (human cDNA sequence) or SEQ ID NO:54. 
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In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a mammal which includes obtaining a sample of material containing 
mRNA from the mammal. Such method includes ascertaining whether a sequence of the 
mRNA encoding (A) a protein having biological activity of myostatin, is present, and whether a 
5 sequence of the mRNA encoding (B) a protein at least partially encoded by a truncated 

nucleotide sequence corresponding to substantially the sequence of the mRNA and lacking the 
activity of (A), is present The absence of (A) and the presence of (B) indicates the presence of 
muscular hyperplasia in the mammal. 

The mRNA encoding (A) and the truncated sequence can correspond to alleles 
10 of DNA of the mammal. 

Again, if an amplification method such as PGR is used in ascertaining whether a 
sequence of the mRNA encoding (A) is present, and whether a sequence of the mRNA encoding 
(B) is present, the method includes amplifying the mRNA in the presence of a pair of primers 
complementary to a nucleotide sequence encoding a protein having biological activity of 
15 myostatin. Each such primer can contain a nucleotide sequence substantially complementary, 
for example, to the sequence identified as SEQ ID NO:7. The truncated sequence can contain at 
least 50 consecutive nucleotides substantially corresponding to 50 consecutive nucleotides of 
SEQ ID NO:7 t for example. 

in another aspect, the invention is a method for determining the presence of 
20 muscular hyperplasia in a mammal which includes obtaining a tissue sample of containing 
mRNA of the mammal and ascertaining whether an mRNA encoding a mutant type myostatin 
protein lacking biological activity of myostatin is present. The presence of such an mRNA 
encoding a mutant type myostatin protein indicates the presence of muscular hyperplasia in the 
mammal. 

25 In another aspect, the invention thus provides a method for determining the 

presence of muscular hyperplasia in a bovine animal. The method includes obtaining a sample 
of material containing DNA from the animal and ascertaining whether DNA having a nucleotide 
sequence encoding a protein having biological activity of myostatin is present. The absence of 
DNA having such a nucleotide sequence indicates the presence of muscular hyperplasia in the 

30 animal. Ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin can include amplifying the DNA in the presence of primers based 
on a nucleotide sequence encoding a protein having biological activity of myostatin. 

In particular, the method can be carried out using a sample from an animal in 
which such a bovine animal not displaying muscular hyperplasia is known to have a nucleotide 

35 sequence which is capable of hybridizing with a nucleic acid molecule having the sequence 
identified as SEQ ID NO:1 under stringent hybridization conditions. 

It is possible that ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
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in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 

Primers, say first and second primers, can be based on first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein the regions flank a 
5 mutation known to naturally occur and which when present in both alleles of a such an animal 
results in muscular hyperplasia. 

It can also be that DNA of such an animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 

10 sequence of DNA of a such an animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 of the coding sequence, and said first primer 
is selected to be upstream of the codon encoding glutamic acid no. 275 and the second primer 
is selected to be downstream of the codon encoding aspartic acid no. 274. 

Also, a DNA of such an animal not displaying muscular hyperplasia might 

15 contain a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2. The coding 
sequence of DNA of such an animal displaying muscular hyperplasia might be known to contain 
an 1 1-base pair deletion beginning at base pair no. 821 . A primer can be selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing the 

20 deletion. 

The animal can be of the Belgian Blue breed. 

In a particular aspect, ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of a primer containing at least a portion of a mutation known to naturally occur 
25 and which when present in both alleles of a said animal results in muscular hyperplasia. 

In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a bovine animal which includes obtaining a sample of the animal 
containing mRNA and ascertaining whether an mRNA encoding a protein having biological 
activity of myostatin is present in the sample. The absence of said mRNA indicates the presence 
30 of muscular hyperplasia in the animal. 

A sample containing mRNA can be muscle tissue, particularly, skeletal muscle 

tissue. 

In a particular aspect, the invention is a method for determining the presence of 
double muscling in a bovine animal, involving obtaining a sample of material containing DNA 
35 from the animal and ascertaining whether the DNA contains the nucleotide sequence identified 
as SEQ ID NO:11 in which the absence of the sequence indicates double muscling in the animal. 

In a particular aspect, the animal is of the Belgian Blue breed. 

In another aspect, the invention is a method for determining the myostatin 
genotype of a mammal, as may be desirable to know for breeding purposes. The method 
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includes obtaining a sample of material containing nucleic acid of the mammal, wherein the 
nucleic acid is uncontaminated by heterologous nucleic acid; ascertaining whether the sample 
contains a (i) nucleic acid molecule encoding a protein having biological activity of myostatin; and 
ascertaining whether the sample contains an (ii) allelic nucleic acid molecule encoding a protein 
5 lacking biological activity of myostatin. The mammal can be bovine. 

In another aspect, the subject is human and (i) includes a nucleic acid sequence 
substantially homologous (in the sense of identity) with the sequence identified as SEQ ID NO:7. 

The invention includes a method of increasing muscle mass of a mammal 
having muscle cells in which myostatin is expressed, the method comprising administering to the 
1 0 mammal an effective amount of a nucleic acid molecule substantially complementary to at least 
a portion of mRNA encoding the myostatin and being of sufficient length to sufficiently reduce 
expression of the myostatin to increase the muscle mass. In a particular aspect, the mammal is 
bovine. 

In another embodiment, the invention is a method of increasing muscle mass of 
15 a mammal, including administering to the mammal an effective amount of a nucleic acid 

molecule having ribozyme activity and a nucleotide sequence substantially complementary to at 
least a portion of mRNA encoding myostatin and being of sufficient length to bind selectively 
thereto to sufficiently reduce expression of the myostatin so as to increase the muscle mass. 

The invention includes a diagnostic kit, for determining the presence of muscular 
20 hyperplasia in a mammal from which a sample containing DNA of the mammal has been 
obtained. The kit includes first and second primers for amplifying the DNA, the primers being 
complementary to nucleotide sequences of the DNA upstream and down stream, respectively, of 
a mutation in the portion of the DNA encoding myostatin which results in muscular hyperplasia of 
the mammal, wherein at least one of the nucleotide sequences is selected to be from a non- 
25 coding region of the myostatin gene. The kit can also includes a third primer complementary to a 
naturally occurring mutation of a coding portion of the myostatin gene. 

A particular diagnostic kit, for determining the genotype of a sample of 
mammalian genetic material, particularly bovine material includes a pair of primers for amplifying 
a portion of the genetic material corresponding to a nucleotide sequence which encodes at least 
30 a portion of a myostatin protein, wherein a first of the primers includes a nucleotide sequence 
sufficiently complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic acid 
molecule containing the mutation, the mutation being selected from the group of mutations 
resulting from: (a) deletion of 1 1 nucleotides beginning at nucleotide 821 of the coding portion of 
SEQ ID NO:1; (b) deletion of 7 nucleotides beginning at nucleotide 419 of the coding sequence 
35 and insertion of the sequence AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of 
the coding sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 
coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 313 of the 
coding sequence and insertion of A in place thereof; and combinations thereof. The second of 
the pair of primers is preferably located entirely upstream or entirely downstream of the selected 
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mutation or mutations, tn one kit, a first said primer spans mutation (a) and further comprising a 
third primer which is sufficiently complementary to the nucleotide sequence identified as SEQ ID 
NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID NO:11 . In another (or 
the same kit), a first said primer is sufficiently complementary to the inserted sequence of 
5 mutation (b) to prime amplification of a nucleic acid molecule containing mutation (b) and further 
comprising a third primer which is sufficiently complementary to the sequence corresponding to 
the 7 nucleotide deletion of mutation (b) to prime amplification of a nucleic acid molecule 
containing the 7 nucleotide deletion of mutation (b). In another (or the same kit), a first said 
primer spans mutation (c) and further comprising a third primer which is sufficiently 
10 complementary to the sequence spanning the corresponding region lacking mutation (c) to prime 
amplification of a nucleic acid molecule lacking mutation (c). In another (or the same kit), a first 
said primer spans mutation (d) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (d) to 
prime amplification of a nucleic acid molecule lacking mutation (d). In another (or the same kit), 
15 a first said primer spans mutation (e) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (e) to 
prime amplification of a nucleic acid molecule lacking mutation (e). 

The invention includes a purified protein having biological activity of myostatin, 
and having an amino acid sequence identified as SEQ ID NO:2, or a conservatively substituted 
20 variant thereof. The invention includes a purified bovine protein having biological activity of 
myostatin or a purified human protein (SEQ ID NO:8) having biological activity of myostatin. 

The invention includes an isolated nucleic acid molecule encoding a foregoing 
protein. Particularly, the invention includes an isolated nucleic acid molecule comprising a DNA 
molecule having the nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:3 or SEQ ID 
25 NO:7 or which varies from the sequence due to the degeneracy of the genetic code, or a nucleic 
acid strand capable of hybridizing with at least one said nucleic acid molecule under stringent 
hybridization conditions. 

The invention includes isolated mRNA transcribed from DNA having a sequence 
which corresponds to a nucleic acid molecule of the invention. 
30 The invention includes isolated DNA in a recombinant cloning vector and a 

microbial cell containing and expressing heterologous DNA of the invention. 

The invention includes a transfected cell line which expresses a protein of the 

invention. 

The invention includes a process for producing a protein of the invention, 
35 including preparing a DNA fragment including a nucleotide sequence which encodes the protein; 
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA molecule 
which includes the DNA fragment and is capable of undergoing replication; 
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transforming a host cell with the recombinant DNA molecule to produce a transformant which 
can express the protein; culturing the transformant to produce the protein; and recovering the 
protein from resulting cultured mixture. 

The invention includes a method of inhibiting myostatin so as to induce 
5 increased muscle mass in a mammal, comprising administering an effective amount of an 
antibody to myostatin to the mammal. 

The invention includes a method of increasing muscle mass in a mammal, by 
raising an autoantibody to the myostatin the in the mammal. Raising the autoantibody can 
include administering a protein having myostatin activity to the mammal. 

10 The invention includes a method of increasing muscle mass in a mammal 

including administering to the mammal an effective amount of an antisense nucleic acid or 
oligonucleotide substantially complementary to at least a portion of the sequence identified as 
SEQ ID NO:1 or SEQ ID NO:5, or SEQ ID NO:7. The portion can be at least 5 nucleotide bases 
in length or longer. The mammal can be a bovine and the sequence can be that identified as 

15 SEQlDNO:1. 

The invention includes a method of inhibiting production of myostatin in a 
mammal in need thereof, including administering to the mammal an effective amount of an 
antibody to the myostatin. 

The invention includes a probe containing a nucleic acid molecule sufficiently 

20 complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as to bind 
thereto under stringent conditions. The probe can be a sequence which is between about 8 and 
about 1 195 nucleotides in length. 

The invention includes a primer composition useful for detection of the presence 
of DNA encoding myostatin in cattle. The composition can include a nucleic acid primer 

25 substantially complementary to a nucleic acid sequence encoding a bovine myostatin. The 
nucleic acid sequence can be that identified as SEQ ID NO:1. 

The invention includes a method for identifying a nucleotide sequence of a 
mutant gene encoding a myostatin protein of a mammal displaying muscular hyperplasia. The 
method includes obtaining a sample of material containing DNA from the mammal and probing 

30 the sample using a nucleic acid probe based on a nucleotide sequence of a known gene 

encoding myostatin in order to identify nucleotide sequence of the mutant gene. In a particular 
approach, the probe is based on a nucleotide sequence identified as SEQ ID NO:1 , SEQ ID NO:5 
or SEQ ID NO:7. Preferably, the probe is at least 8 nucleic acids in length. The step of probing 
the sample can include exposing the DNA to the probe under hybridizing conditions and further 

35 comprising isolating hybridized nucleic acid molecules. The method can further include the step 
of sequencing isolated DNA. The method can include the step of isolating and sequencing a 
cDNA or mRNA encoding the complete mutant myostatin protein. The method can include a 
step of isolating and sequencing a functional wild type myostatin from the mammal not displaying 
muscular hyperplasia. 
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The method can include comparing the complete coding sequence of the 
complete mutant myostatin protein with, if the coding sequence for a functional wild type 
myostatin from such a mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from such a mammal is previously unknown, (2) 
5 the sequence determined according to claim 63 or claim 66, to determine the location of any 
mutation in the mutant gene. 

The invention includes a primer composition useful for the detection of a 
nucleotide sequence encoding a myostatin containing a first nucleic acid molecule based on a 
nucleotide sequence located upstream of a mutation determined according to a method of the 
10 invention and a second nucleic acid molecule based on a nucleotide sequence located 
downstream of the mutation. 

A probe of the invention can include a nucleic acid molecule based on a 
nucleotide sequence spanning a mutation determined according to the invention. 

The invention includes an antibody to a protein encoded by a nucleotide 
1 5 sequence identified as SEQ ID NO:1 , SEQ ID NO:3 or SEQ ID NO:7, or other protein of the 
present invention. 

The invention includes a transgenic bovine having a genome lacking a gene 
encoding a protein having biological activity of myostatin; a transgenic mouse having a genome 
containing a gene encoding a human protein having biological activity of myostatin or containing 

20 a gene encoding a bovine protein having biological activity of myostatin; a transgenic bovine 

having a gene encoding a bovine protein having biological activity of myostatin and heterologous 
nucleotide sequence antisense to the gene. The transgenic bovine can include a gene encoding 
a nucleic acid sequence having ribozyme activity and in transcriptional association with the 
nucleotide sequence antisense to the gene. 

25 The invention includes a transgenic mammal, usually non-human, having a 

phenotype characterized by muscular hyperplasia, said phenotype being conferred by a 
transgene contained in the somatic and germ cells of the mammal, the transgene encoding a 
myostatin protein having a dominant negative mutation. The transgenic mammal can be male 
and the transgene can be located on the Y chromosome. The mammal can be bovine and the 

30 transgene can be located to be under the control of a promoter which normally a promoter of a 
myosin gene. 

Another transgenic mammal, usually non-human, of the invention has a 
phenotype characterized by muscular hyperplasia, in which the phenotype is conferred by a 
transgene having a sequence antisense to that encoding a myostatin protein of the mammal. 
35 The mammal can be a male bovine and the transgene can be located on the Y chromosome. 
The transgene can further include a sequence which when transcribed obtains an mRNA having 
ribozyme activity. 

A transgenic non-human mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype inducible and conferred by a 
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myostatin gene flanked by J oxP sides and a Cre transgene under the dependence of an 
inducible promoter. 

A transgenic non-hurnan male mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype conferred by a myostatin gene 
5 flanked by J oxP sides and a Cre transgene located on the Y chromosome. 

The invention includes a method for determining whether a sample of 
mammalian genetic material is capable of a conferring a phenotype characterized by muscular 
hyperplasia, comprising ascertaining whether the genetic material contains a nucleotide 
sequence encoding a protein having biological activity of myostatin, wherein the absence of said 
10 sequence indicates the presence of muscular hyperplasia in the animal. 

Brief Description Of Drawings 

In describing particular aspects of the invention, reference is made to the 

accompanying drawings, in which: 

Figure 1 is a schematic summary of genetic, physical and comparative mapping 
1 5 information around the bovine mh locus. A multi-point lodscore curve obtained for the mh locus 

with respect to the microsateilite marker map is shown. Markers that were not informative in the 

pedigree used are shown between brackets; their map position is inferred from published 

mapping data. Markers and the YACs from which they were isolated are connected by arrows. 

The RH-map of the relevant section of human HSA2 is shown, with the relative position in cR of 
20 the ESTs used. Stippled lines connect microsateilite and Type I markers with their respective 

positive YACs. YACs showing cross-hybridizing SINE-PCR products are connected by the red 

boxes. 

Figure 2(a) shows electropherograms obtained by cycle-sequencing the 

myostatin cDNA sequence from a double-muscled and a conventional animal, showing the 
25 nt821del(11) deletion (SEQ ID NO:1 1) in the double-muscled animal. The primers used to 

amplify the fragment encompassing the deletion from genomic DNA are spaced apart from the 

remaining nucleotides. 

Figure 2(b) shows the amino-acid sequence of the murine (top row), bovine 

normal (middle row) and bovine nt821del(11) (bottom row) allele. The putative site of proteolytic 
30 processing is boxed, while the nine conserved cysteines in the carboxy-terminai region are 

underlined. The differences between the normal and nt821del(11) bovine allele are indicated by 

the double underlining. 

Figure 3 is a schematic representation of the bovine myostatin gene with position 

and definition of the identified DNA sequence polymorphisms. The "A" (clear) boxes correspond 
35 to the untranslated leader and trailer sequences (large diameter), and the intronic sequences 

(small diameter) respectively. The "B", "C\ and "D" boxes correspond to the sequences coding 

for the leader peptide, N-terminal latency-associated peptide and bioactive carboxyterminai 

domain of the protein respectively. Small "e", T and "g" arrows point towards the positions of the 
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primers used for intron amplification, exon amplification and sequencing and exon sequencing 
respectively; the corresponding primer sequences are reported in Table 1. The positions of the 
identified DNA sequence polymorphisms are shown as "h", T or *j w lines on the myostatin gene 
for silent, conservative and disrupting mutations respectively. Each mutation is connected via an 
5 arrow with a box reporting the details of the corresponding DNA sequence and eventually 
encoded peptide sequence. In each box, the variant sequence is compared with the control 
Holstein-Friesian sequence and differences are highlighted in color. 

Figure 4 shows the distribution of identified mutations in the various breeds 
examined. The order of the myostatin mutations correspond to Figure 3. All analyzed animals 
1 o were double-muscled except for the two Holstein-Friesian and two Jerseys used as controls 
(column 1). 

Detailed Description Of Preferred Embodiments 

The method used for isolating genes which cause specific phenotypes is known 
as positional candidate cloning. It involves: (i) the chromosomal localization of the gene which 

15 causes the specific phenotype using genetic markers in a linkage analysis; and (ii) the 

identification of the gene which causes the specific phenotype amongst the "candidate" genes 
known to be located in the corresponding region. Most of the time these candidate genes are 
selected from available mapping information in humans and mice. 

The tools required to perform the initial localization (step (i) above) are 

20 microsatellite marker maps, which are available for livestock species and are found in the public 
domain (Bishop et a/., 1994; Barendse etai, 1994; Georges ef a/., 1995; and Kappes, 1997). 
The tools required for the positional candidate cloning, particularly the YAC libraries, (step (ii) 
above) are partially available from the public domain. Genomic libraries with large inserts 
constructed with Yeast Artificial Chromosomes ("YAC") are available in the public domain for 

25 most livestock species including cattle. When cross-referencing the human and mice map, it is 
necessary to identify the positional candidate, which is available at low resolution but needs to be 
refined in every specific instance to obtain the appropriate level of high resolution. A number of 
original strategies are described herein to achieve this latter objective. For general principles of 
positional candidate cloning, see Collins, 1995 and Georges and Andersson, 1996. 

30 In order to allow for cross-referencing between the bovine and human gene 

map as part of the positional candidate cloning approach, HSA2q31-32 (map of the long arm of 
human chromosome 2, cytogenetic bands q31-32) and BTA2q12-22 (map of the arm of bovine 
chromosome 2, cytogenetic bands q12-22) were integrated on the basis of coincidence bovine 
YAC's as described below. 
35 Using a previously described experimental [(normal x double-muscled) x. double- 

muscled] backcross population comprising 108 backcross individuals, the mh locus was recently 
mapped by linkage analysis to the centromeric tip of bovine chromosome 2 (BTA2), at 3.1 
centiMorgan proximal from the last marker on the linkage map: TGLA44 (Charlier et a/., 1995). It 
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was also known from previous work that pro-a(Hi) collagen (CoI3AI) was located in the same 
chromosomal region as the mh locus. Col3AI has been mapped to BTA2q12-22 by in situ 
hybridization (Solinas-Toldo ef a/., 1995), while a Col3AI RFLP marker was shown to be closely 
linked to TGLA44 (6=2%)(Fisher et a/., 1996). This identifies the region flanking Col3AI on the 
5 human map, i.e. HSA2q31-32, as the likely orthologous human chromosome segment. This 
assumption is compatible with data from Zoo-FISH experiments (Solinas-Toldo et a/., 1995) as 
well as mapping data of Type I markers on somatic cell hybrids (O'Brien et a/., 1993), which 
establish an evolutionary correspondence between segments of HSA2q and BTA2. 

In order to refine the correspondence between the HSA2q31-33 and BTA2q12- 

10 22 maps, Comparative Anchored Tagged Sequences or CATS, i.e. primer pairs that would 

amplify a Sequence Tagged Site or STS from the orthologous gene in different species (Lyons et 
a/., 1996), were developed for a series of genes flanking Col3A1 on the human map and for 
which sequence information was available in more than one mammal. In addition to Col3At, 
working CATS were obtained for a2(V) collagen (Col5A2), inositol poiyphosphate-1 phosphatase 

1 5 (INPP1), tissue factor pathway inhibitor precursor (TFPl), titin (7TA/). n-chimaerin (CHN), 

glutamate decarboxylase 67 (GAD1), Cytotoxic T-lymphocyte-associated protein 4 (CTLA4) and 
T-cell membrane glycoprotein CD28 (CD28). The corresponding primer sequences are given in 
Table 1. 
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Tablel: 





CATS 




INPP1 


UP: 5 CAGCAAAGTCCTTAATGGTAACAAGC 3 


UN. t> Caoli t CAC 1 (jAALjAAAAOLs 1 OO 1 La o 




COL3A1 


UP: 5" CCCCATATTATGGAGATGAACCG 3' 


DN: 5* AGTTCAGGATGGCAGAATTTCAG 3* 


5 


COL5A2 


UP: 5 GCAAACTGGGYGGRAGCAAGACC 3* 


DN: S TTSTTCCTGGGCTTTTATTGAGAC 3' 




TFP1 


UP: 5' AAGCCWGATTTCTGCTTYTTGGAAG 3' 


DN: 5 TGCCMAGGCAHCCRCCRTACTTGAA 3' 




TTN 


UP: S GGTCGTCCTACACCAGAAG 3* 


DN: 5' GGTTGACATTGTCAAGAACAAG 3' 




CHN 


UP: S TCTCMAAAGTCGTCTGTGACAATC 3* 


DN: S TGYTCRTTTTCTTTCAGAGTTGC 3* 




GAD1 


UP: 5* RCTGGTCCTCTTCACCTCAGAAC 3' 


DN: 5* ACATTGTCVGTTCCAAAGCCAAG 3* 


10 


CTLA4 


UP: S AGGTYCGGGTGACDGTGCTKC 3* 


DN: 5* TGGRTACATGAGYTCCACCTTGC 3* 




CD28 


UP: 5' AGCTGCARGTATWCCTACAAYCT 3* 


DN: 5' GTYC C RTTGCTC YTCTC RTTGYC 3' 




. Microsatellite markers 




TGLA44 


UP:5' AACTGTATATTGAGAGCCTACCATG 3* 


DN: 5* CACACCTTAGCGACTAAACCACCA 3* 




BULGE27 


UP: 5' CTACCTAACAGAATGA i;l 1 I GTAAG 3' . 


DN: 5' AGTGTTCTTGCCTAGAGAATCCCAG 3' 


15 


8ULGE23 


UP: 5* ACATTCTCTCACCAATATGACATAC 3' 


DN: 5' TAAGTCACCATTACATCCTTAGAAC 3 




BM81124 


UP: 5* GCTGTAAGAATCTTCATTAAGCACT 3* 


DN: 5' CCTGATACATGCTAAGGTTAAAAAC 3" 




BULGE28 


UP: 5* AGGCATACATCTGGAGAGAAACATG 3' 


DN: 5' CAGAGGAGCCTAGCAGGCTACCGTC 3' 




BULGE20 


UP: 5' CAGCAGGTCTGTTGAAGTGTATCAG 3* 


DN: 5' AGTGGTAGCATTCACAGGTAGCCAG 3' 




BM3627 


UP: 5' CAGTCCATGGCACCATAAAG 3' 


DN: 5' TCCGTTAGTACTGGCTAATTGC 3* 


20 


ILSTS026 


UP: S CTGAATTGGCTCCAAAGGCC 3' 


DN: 5* AAACAGAAGTCCAGGGCTGC 3' 




INRA40 


UP: 5' TCAGTCTCCAGGAGAGAAAAC 3* 


DN: 5* CTCTGC CCTGGG GATG ATTG 3* 




Bovine Myostatin primers 




GDF8.19 


5' AATGTATGTTTATATTTAC CTGTTCATG 3* 






GDF8.11 


5' ACAGTGTTTGTGCAAATCCTGAGAC 3' 




25 


GDF8.12 


5' CAATGCCTAAGTTGGATTCAGGTTG 3' 






GDF8.25 


5' CTTGCTGTAACCTTCCCAGGACCAG 3* 






GDF8.15 


5' TCCCATCCAAAGGCTTCAAAATC 3* 






GDF8.21 


5' ATACTCWAGGCCTAYAGCCTGTGGT 3' 





Reading from left to right and down the table, the sequences given in Table 1 are identified as 
30 SEQ ID NO:12 to SEQ ID NO:53 f respectively. 



These CATS were used to screen a 6-genome equivalent bovine YAC library by 
PCR using a three-dimensional pooling strategy as described by Libert et a/., 1993. The same 
YAC library was also screened with all microsatellite markers available for proximal BTA2, i.e. 
TGLA44, BM81124, BM3627, ILSTS026, INRA40 and TGLA431 (Kappes etai, 1997). 

35 Potential overlap between the YACs obtained with this panel of STS's was 

explored on the basis of common STS content, as well as cross-hybridization between SINE- 
PCR product from individual YACs. From this analysis, three independent YAC contigs emerged 
in the region of interest: (i) contig A containing microsatellites TGLA44, BM81124 and Type I 
marker INPP1] (ii) contig B containing Col3AI and Col5A2\ and (iii) contig C containing 

40 microsatellite markers BM3627, ILSTS026 and INRA40, and Type i marker TFPL 

None of the available microsatellites mapped to contig B, therefore this cluster 
of YACs could not be positioned in cattle with respect to the two other contigs. Available 
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mapping information in the human, however, allowed prediction of contig B's position between 
contigs A and C. To test this hypothesis, two new microsateliite markers were isolated from 
contig B t BULGE20 and BULGE28. BULGE20 proved to be polymorphic, allowing for 
genotyping of the experimental backcross population. 
5 In addition, to increase the informativeness of the markers available for contig A, 

two new microsateliite markers were developed from this contig: BULGE23 and BULGE27. 
BULGE23 proved to be polymorphic and was used to type the same pedigree material. 

All resulting genotypes were used to construct a linkage map using the (LINK 
program (Lathrop and Lalouel, 1984). The following most likely order and sex-averaged 
10 recombination rates between adjacent markers was obtained: [TGLA44-(0%)-BULG23]-(6,1%)- 
BULG20-(1,6%)-ILSTS026-(2.3%)-INRA40-(7,1%)-TGLA431. The position of BULGE20 between 
TGLA44 and 1LSTS026 confirmed the anticipated order of the three contigs. Figure 1 
summarizes the resulting mapping information. 

A multi point linkage analysis was undertaken using LINKMAP, to position the 
15 mh locus with respect to the new marker map. Linkage analysis was performed under a simple 
recessive model, assuming full penetrance for mh/mh individuals and zero penetrance for the 
two other genotypes. The LOD score curve shown in Figure 1 was obtained, placing the mh locus 
in the TGLA44-BULGE20 interval with an associated maximum LOD score of 26.4. Three 
backcross individuals were shown to recombine with the BULGE20 and distal markers, but not 
20 with TGLA44 and BULGE23, therefore placing the mh locus proximal from this marker. One 
individual, was shown to recombine with TGLA44 and BULGE23, but not with the more distal 
markers, therefore positioning the mh locus distal from TGLA44 and BULGE23. Given the 
relative position of these microsateliite markers with respect to INPP1 and Col3AI as deduced 
from the integration of the human and bovine map, these results indicated that the mh gene is 
25 likely located in a chromosome segment bounded by INPP1 and Col3AL 

Recently, McPherron etai (1997) demonstrated that mice homozygous for a 
knock-out deletion of GDF-8 or myostatin, were characterized by a generalized increase in 
skeletal muscle mass. Using the published 2676bp murine myostatin cDNA sequence 
(GenBank accession number U84005), a Tentative Human Consensus (THC) cluster in the 
30 Unigene database was identified which represented three cDNA clones (221299, 300367, 
308202) and six EST (Expressed Sequence Tag) sequences (H92027, H92028, N80248, 
N95327, W07375, W24782). The corresponding THC covered 1296 bp of the human myostatin 
gene, showing an homology of 78.1% with the murine sequence when averaged over the entire 
sequence, and 91.1% when considering only the translated parts of the human and murine 
35 genes (566bp). This THC therefore very likely corresponds to the human orthologue of the 

murine myostatin gene. Primers (5-GGCCCAACTATGGATATATTTG-3' (SEQ ID NO:9) and 5'- 
GGTCCTGGGAAGGTTACAGCA-3' (SEQ ID NO:10)) were thus prepared to amplify a 272 bp 
fragment from the second exon of human myostatin and used to genotype the whole-genome 
Genebridge-4 radiation hybrid panel (Walter et a/., 1994). The RHMapper program (Slonim et 
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a/. f unpublished) was used to position the myostatin gene with respect to the Whitehead/MIT 
framework radiation hybrid map, placing it at position 948,7 cR of the HSA2 map with an 
associated lodscore > 3. Closer examination of the myostatin segregation vector and its 
confrontation with the vectors from ail markers located in that region (Data Release 1 1 .9, May 
5 1997) showed it to be identical to EST SGC38239 placed on the Whitehead/MIT radiation hybrid 
map (Hudson et a/., 1995) at position 946.8 cR of HSA2. This places the human myostatin gene 
on the RH-map in the interval between Col3AI (EST WI16343 - 942.5 cR) and INPP1 (EST 
L08488 - 950.2 to 951 .2 cR)(Figure 1). Myostatin therefore appeared as a very strong positional 
candidate for the mh gene, 

10 To test the potential involvement of myostatin in the determinism of double- 

muscling in cattle, primer pairs were designed based on the available mouse and human 
myostatin sequence, with the objective to amplify the entire coding sequence from bovine cDNA 
using PCR (Polymerase Chain Reaction). Whenever possible, primers were therefore positioned 
in portions of the myostatin sequence showing 100% homology between mouse and human. 

15 Two primer pairs were identified that amplified what was predicted to represent ,98.4% of the 
bovine coding sequence plus 74 bp of 3 1 untranslated sequence, in two overlapping DNA 
fragments, respectively 660 (primers GDF8.19 - GDF8.12) and 724 bp (primers GDF8.1 1 - 
GDF8.21) long. The expected DNA products were successfully amplified from cDNA 
generated from skeletal muscle of both a normal (homozygous +/+) (SEQ ID NO:1) and a 

20 double-muscled (homozygous mh/mh) (SEQ ID NO:3) animal, and cycle-sequenced on both 
strands. 

The nucleotide sequence corresponding to the normal allele presented 88.9% 
identity with the mouse myostatin sequence (SEQ ID NO:5) over a 1067 bp overlap, and 
contained the expected open reading frame encoding a protein (SEQ ID NO:2) showing 92.9% 

25 identity in a 354 amino-acid overlap with mouse myostatin (SEQ ID NO:6). As expected for a 
member of the TGFp superfamily, the bovine myostatin gene is characterized by a proteolytic 
processing site thought to mediate cleavage of the bioactive carboxy-terminal domain from the 
longer N-terminai fragment, and by nine cysteine residues separated by a characteristic spacing 
and suspected to be involved in intra- and inter-molecular disulfide bridges (McPherron and Lee, 

30 1996). 

The nucleotide sequence obtained from the mh allele was identical to the 
normal allele over its entire length, except for an 1 1 bp deletion involving nucleotides 821 to 831 
(counting from the initiation codon). This frame shifting deletion, occurring after the first cysteine 
residue of the carboxy-terminal domain, drastically disrupts the downstream amino-acid 
35 sequence and reveals a premature stop-codon after 13 amino acids, see Figure 2. The amino 
acid sequence encoded by the mutant nucleic acid sequence is identified as SEQ ID NO:4. This 
mutation disrupts the bioactive part of the molecule and is therefore very likely to be the cause of 
the recessive doubie-muscling phenotype. Following conventional nomenclature, this mutation 
will be referred to as nt821(de!11). 
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To further strengthen the assumption of the causality of this mutation, primer 
pairs flanking the deletion (Figure 2) were prepared and the corresponding DNA segment from 
alt animals from the experimental backcross population amplified. PCR was performed in the 
presence of dCTP 32 in order to radioactively label the amplification product. Amplification 
5 products were separated on denaturing polyacrylamide gels and detected by autoradiography. A 
188 bp product would be expected for the normal allele and a 177 bp product for the 
nt821(del11) allele. Correlation between phenotype and genotype was matched for the entire 
pedigree. All ten BBCB double-muscled sires were found to be homozygous for the 
nt821(del11) mutation, all 41 F1 females were heterozygous, while 53 double-muscled offspring 

1 0 were homozygous for the mutation, the remaining 55 conventional animals were heterozygous. 

To examine the distribution of the nt821(del11) mutation in different conventional 
and double-muscled breeds, a cohort of 25 normal individuals was genotyped representing two 
dairy breeds (Holstein-Friesian, Red-and -White) and a cohort of 52 double-muscled animals 
representing four breeds (BBCB, Asturiana, Maine-Anjou and Piemontese). The results are 

1 5 summarized in Table 2. All dairy animals were homozygous normal except for one Red-and- 
White bull shown to be heterozygous. The occurrence of a small fraction of individuals carrying 
the mutation in dairy cattle is not unexpected as the phenotype is occasionally described in this 
breed. In BBCB and Asturiana, all double-muscled animals were homozygous for the 
nt821(del11) deletion, pointing towards allelic homogeneity in these two breeds. Double- 

20 muscled Maine-Anjou and Piemontese animals were homozygous "normal", i.e. they did not 
show the nt821(del11) deletion but a distinct cysteine to tyrosine substitution (C313Y) in double- 
muscled Pi6dmontese animals identified by others (Kambadur et a/., 1997) was discovered. 



Table 2: 



25 



Breed 


Phenotype 


Genotype 

+/+ +/nt$21(del1 1) nt821(det1 1)/nt82l(del11) 


Belgian Blue 


DM 






29 


Asturiana 


DM 






10 


Piemontese 


DM 


8 






Maine-Anjou 


DM 


4 






Holstein-Friesian 


Normal 


13 






Red-and-White 


Normal 


12 


1 





The entire coding sequence was also determined for the myostatin gene in 
double-muscled individuals from ten European cattle breeds and a series of mutations that 
35 disrupt myostatin function were identified. 
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The coding sequence of four control Holstein-Friesian and Jersey individuals 
was identical to the previously described wild-type allele (Grobet et a/., 1997), further indicating 
that it was the genuine myostatin coding sequence being amplified, and not a non-functional 
pseudogene. 

5 Amongst the 32 double-muscled animals, seven DNA sequence variants within 

d the coding region were found, as summarized in Figure 3. 

In addition to the nt821(de!11) mutation in the third exon, described above, four 
new mutations that would be expected to disrupt the myostatin function were found. An 
insertion/deletion at position 419 counting from the initiation codon, replacing 7 base pairs with an 
1 0 apparently unrelated stretch of 1 0 base pairs, reveals a premature stop codon in the N-terminal 
latency-associated peptide at amino-acid position 140. This mutation is referred to as 
nt419(det7-ins10). Two base pair substitutions in the second exon, a C-*T transition at 
nucleotide position 610 and a G-*T transversion at nucleotide position 676, each yield a 
premature stop codon in the same N-terminal latency-associated peptide at amino-acid positions 
15 204 and 226 respectively. These mutations are called Q204X and E226X respectively. Finally, a 
G-+A transition at nucleotide position 938 results in the substitution of a cysteine by a tyrosine. 
This mutation is referred to as C313Y. This cysteine is the fifth of nine highly conserved cysteine 
residues typical of the members of the TGF-p superfamily and shared in particular by TGF-pi , - 
p2 and -p3, and inhibin-pA and -PB (McPherron & Lee, 1996). It is thought to be involved in an 
20 intramolecular disulfide bridge stabilizing the three-dimensional conformation of the bioactive 
carboxyterminal peptide. Its substitution is therefore likely to affect the structure and function of 
the protein. This C313Y has recently also been described by Kambadur etal (1997). 

A conservative phenylalanine to leucine substitution was also found at amino- 
acid position 94 in the first exon, due to a C->A transversion at nucleotide position 282 of the 
25 myostatin gene. Given the conservative nature of the amino-acid substitution, its location in the 
less conserved N-terminal latency-associated peptide, and as this mutation was observed at the 
homozygous condition in animals that were not showing any sign of exceptional muscular 
development, this mutation probably does not interfere drastically with the myostatic function of 
the encoded protein, if at all. This mutation is referred to as F94L The murine protein is 
30 characterized by a tyrosine at the corresponding amino-acid position. 

Also identified was a silent C— >T transition at the third position of the 1 38th 
cytosine codon in the second exon, referred to as nt414(C-T). 

In addition to these DNA sequence polymorphisms detected in the coding region 
of the myostatin gene, also found were four DNA sequence variants in intronic sequences which 
35 are probably neutral polymorphisms and which have been assigned the following symbols: 

nt374-51(T-C), nt374-50(G-A), nt374-16(de!1) in intron 1, and nt748-78(de!1) in intron 2 (Figure 
3). 

Figure 4 shows the observed distribution of mutations in the analysed sample 
sorted by breed. For the majority of the studied breeds, the analyzed double-muscled animals 
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were homozygous for one of the five described mutations expected to disrupt the myostatin 
function or compound heterozygotes for two of these mutations. This is compatible with the 
hypothesis that the double-muscled condition has a recessive mode of inheritance in all these 
breeds. 

5 Only in Limousin and Blonde d'Aquitaine was there no clear evidence for the 

role of myostatin loss-of-function mutations in the determinism of the observed muscular 
hypertrophy. Most Limousin animals were homozygous for the conservative F94L substitution 
which is unlikely to cause the muscular hypertrophy characterizing these animals, as discussed 
above. One Limousin animal proved to be heterozygous for this mutation, the other allele being 

10 the "wild-type" one. All Blonde d'Aquitaine animals were homozygous "wild-type". These data 
indicate either that the myostatin gene is possibly not involved in the double-muscled condition 
characterizing these two breeds, or that there are additional myostatin mutations outside of the 
coding region. The double-muscling condition is often considered to be less pronounced in 
Limousin animals compared to other breeds. 

1 5 The data indicate that some mutations, such as the nt821del(1 1) and C313Y, 

are shared by several breeds which points towards gene migration between the corresponding 
populations, while others seems to be confined to specific breeds. Moreover, while some breeds 
(the Belgian Blue breed in particular) seem to be essentially genetically homogeneous others 
show clear evidence for allelic heterogeneity (e^g. Maine-Anjou). 

20 The observation of allelic heterogeneity contradicts with the classical view that a 

single mh mutation spread through the European continent in the beginning of the 19th century 
with the dissemination of the Shorthorn breed from the British Isles (Menissier, 1982). Two of 
the mutations at least are shared by more than one breed, indicating some degree of gene 
migration but definitely not from a single origin. 

25 In mice, and in addition to the in vitro generated myostatin knock-out mice 

(McPherron & Lee, 1997), the compact mutation could be due to a naturally occurring mutation 
at the myostatin gene. The compact locus has been mapped to the D1Mit375~D1Mit21 interval 
on mouse chromosome 1 known to be orthologous to HSA2q31-32 and BTA2q12-22 (Varga et 
a/., 1997). 

30 From an applied point of view, the characterisation of a pane! of mutations in 

the myostatin gene associated with double-muscling contributes to the establishment of a 
diagnostic screening system allowing for marker assisted selection for or against this condition in 
cattle. 

Example 1 
35 Genetic and physical mapping 

Integration of the HSA2q31-32 and BTA2q12-22 maps was done by using 
coincident YAC's and the mh locus was positioned in the interval flanked by Col3AI and INPP1 as 
follows. Genetic mapping was performed using a previously described (Holstein-Friesian x 
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Belgian Blue) x Belgian Blue experimental backcross population counting 108 informative 
individuals (Charlier et a/., 1995). Microsatellite genotyping was performed according to standard 
procedures (Georges et a/., 1995) f using the primer sequences reported in Table 1. Linkage 
- analyses were performed with the MLINK, ILINK and LINKMAP programs of the LINKAGE 
5 (version 5.1) and FASTLINK (2.3P version, June 1995) packages (Lathrop & Lalouel, 1984; 
Cottingham et al. t 1 993). The YAC library was screened by PCR using a three dimensional 
pooling scheme as described in Libert et a/., 1993. The primer pairs corresponding to the CATS 
used to screen the library are reported in Table 1 . Cross-hybridisation between SINE-PCR 
products of individual YACs was performed according to Hunter et al. (1 996), using primers 
10 reported in Lenstra et al. (1993). Microsatellites were isolated from YACs according to Cornells 
et al. (1992). 

Example 2 

Mapping of the human myostatin gene on the Genebridge-4-panei 

DNA from the Genebridge-4 panel (Walter et al., 1994) was purchased from 
15 Research Genetics (Huntsville t Alabama), and genotyped by PCR using standard procedures 
and the following human myostatin primer pair (5-G G CCC AACTATG G ATATATTTG -3 ' and 5- 
G GTC CTG G G AAG GTTAC AG C A-3 ') . Mapping was performed via the WWW server of the 
Whitehead Institute/MIT Center for Genome Research using their RH-mapper program (Slonim, 
D.; Stein, L.; Kruglyak, L.; Lander, E., unpublished) to position the markers with respect to the 
20 framework map. Segregation vectors of the query markers were compared with the vectors from 
all markers in the region of interest in the complete Data Release 1 1 .9 (May 1997) to obtain a 
more precise position. This positions myostatin in the INPP1-Col3Al on the human map with LOD 
score superior to 3. 

Example 3 
25 RT-PCR 

To clone the bovine myostatin orthologue a strategy based on RT-PCR 
amplification from skeletal muscle cDNA was chosen. Total RNA was extracted from skeletal 
muscle (Triceps brachialis) according to Chirgwin et al. (1979). RT-PCR was performed using 
the Gene-Amp RNA PCR Kit (Perkin-Elmer) and the primers reported in Table 1 . The PCR 
30 products were purified using QiaQuick PCR Purification kit (Qiagen) and sequenced using Dye 
terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic 
sequencer, using the primers reported in Table 2. 

Example 4 

Diagnosis of the nt821(de!11) deletion 

35 To diagnose the nt821(de!11) the following primer sequences were designed 

flanking the nt821(del1 1) deletion: S'-TCTAGGAGAGATTTTGGGCTT-S' (SEQ ID NO:53) and 5- 
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G ATG G GTATG AG G ATACTTTTG C-3* (SEQ ID NO:52). These primers amplify a 188 bp 
fragments from normal individuals and a 177bp fragment from double-muscled individuals. 
Heterozygous individuals show the two amplification products. These amplification products can 
be detected using a variety of methods. In this example the PCR product was labelled by 
5 incorporation of dCTP 32 , separated on a denaturing acrylamide gel and revealed by 
autoradiography. Other approaches that could be used to distinguish the three different 
genotypes are known to those skilled in the art and would include separation in agarose gels and 
visualization with ethidium bromide, direct sequencing, TaqMan assays, hybridization with allele 
specific oligonucleotides, reverse dot-blot, RFLP analysis and several others. The specificity of 
1 0 the test is linked to the detected mutation and not to the primers used in the detection method. 
That means that other primers can easily be designed based on said bovine myostatin sequence 
that would fulfill the same requirements. 

Example 5 

Determination of mutations in other breeds 

15 A total of 32 animals with extreme muscular development were sampled from 

ten European beef cattle breeds in which double-muscled animals are known to occur at high to 
moderate frequency: (i) Belgium: Belgian Blue (4), (ii) France: Blonde d'Aquitaine (5), Charolais 
(2), Gasconne (2), Limousin (5), Maine-Anjou (4), Parthenaise (3), (iii) Spain: Asturiana (2), 
Rubia Gallega (2), (iv) Italy: Piedmontese (2). The determination of the double-muscled 

20 phenotype of the sampled animals was performed visually by experienced observers. Four 

animals with conventional phenotype sampled from the Hoistein-Friesian (2) and Jersey (2) dairy 
populations were analysed as controls. 

In order to facilitate the study of the myostatin coding sequence from genomic 
DNA, the sequences of the exon-intron boundaries of the bovine gene were determined. In mice, 

25 the myostatin gene is known to be interrupted by two introns, respectively ~1 .5 and 2.4 Kb long 
(McPherron & Lee, 1997). Two primer pairs were thus designed, respectively, in bovine exons 1 
and 2, and exons 2 and 3, that were predicted to flank the two introns, assuming conservation of 
gene organisation between mouse and cattle (Figure 3 and Table 3). Using these primer sets, 
two PCR products respectively 2Kb and 3.5Kb long were generated from a YAC clone (1 79A3) 

30 containing the bovine myostatin gene (Grobet et a/., 1997). The PCR products were purified 

using QiaQuick PCR Purification kit (Qiagen) and partially sequenced using Dye terminator Cycle 
Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic sequencer. Alignment 
with the bovine cDNA sequence identified the four predicted exon-intron boundaries. The 
nucleotide sequence corresponding to bovine genomic DNA is identified as SEQ ID NO:54. 
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Table 3: Primers used for PGR amplification and cycle sequencing. 



lntron1-5' 


5-GAAG AC GATG ACTAC CAC 
GCCAGGACG-3* 


Intron1-3* 


5*-CTAGTTTATTGTATTGTATCTT 
AGAGC-3* 


lntron2-5' 


S-AGACTCCTACAACAGTGT 
TTGT-3* 


tntron2-3* 


S'-ATACTCWAGGCCTAYAGCCT 
GTGGT-3* 


Exonl-5 1 


ff-ATTCACTGGTGTGGCAAG 
TTGTCTCTCAGA-3 1 


Exon1-3' 


S-CCCTCCTCCTTACATACAAGC 
CAGCAG-3* 


Exon2-5' 


5-GTTCATAGATTGATATGG 
AGGTGTTCG-3* 


Exon2-3' 


5-ATAAGCACAGGAAACTGGTAG 
TTATT-3' 


Exon3-5* 


S-GAAATGTGAC ATAAG C AA 
AATGATTAG-3' 


Exon3-3* 


S'-ATACTCWAGGCCTAYAGCCT 
GTGGT-3* 


Exon1-Seq1 


5-TTGAGGATGTAGTG I 1 1 1 
CC-3' 


Exon1-Seq2 


S'-GCCATAAAAATCCAAATCCTC 
AG-3' 


Exon2-Seq1 


5'-CATTTATAGCTGATCTTC 
TAACGCAAG-3' 


Exon2-Seq2 


S'-TGTCGCAGGAGTCTTGACAG 
GCCTCAG-3* 


Exon2-Seq3 


5-GTACAAGGTATACTGGAA 
TCCGATCTC-3' 






Exon3-Seq1 


5-AGCAGGGGCCGGCTGAA 
CCTCTGGG-3' 


Exon3-Seq2 


5*-CCCCAGAGGTTCAGCCGGCC 
CCTGC-3* 



Based on the available exonic and intronic sequences of the bovine myostatin 
gene, three primer pairs that jointly allow for convenient amplification of the entire coding 
sequence from genomic DNA were designed. The position of the corresponding primers is 
shown in Figure 3, and the corresponding sequences are reported in Table 3. 

1 5 After PCR amplification of the entire coding sequence from genomic DNA in the 

three described fragments, these were purified using QiaQuick PCR Purification kit (Qiagen) and 
sequenced using Dye terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an 
ABI373 automatic sequencer, using the primers used for amplification as well as a series of 
nested primers (Figure 3 and Table 3). Chromat files produced with the ABI373 sequencer were 

20 analysed with the Polyphred application (D. Nickerson, personal communication), which is part of 
a series of sequence analysis programs including Phred (Ewing, B. & Green, P. (1992), 
unpublished), Phrap (Green, P. (1994), unpublished) and Consed (Gordon, D. (1995), 
unpublished), but any suitable sequencing programme would do, as known to a person skilled in 
the art. 

25 Monoclonal antibodies (Mab's) specific for myostatin are useful. In the case of 

the bovine protein having the amino acid sequence identified as SEQ ID NO:2 f for example, 
antibodies can be used for diagnostic purposes such as for determining myostatin protein levels 
in muscle tissue. To produce these antibodies, purified myostatin is prepared. The myostatin 
can be produced in bacterial cells as a fusion protein with glutathione-S-transferase using the 

30 vector pGEX2 (Pharmacia). This permits purification of the fusion protein by GSH affinity 
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chromatography. In another approach, myostatin is expressed as a fusion protein with the 
bacterial maltose binding domain. The fusion protein is thus recovered from bacterial extracts by 
passing the extract over an amylose resin column followed by elution of the fusion protein with 
maltose. For this fusion construct, the vector pMalC2 ( commercially available from New England 
5 Biolabs, can be used. The preparation of a second fusion protein is also useful in the preliminary 
screening of MAb's. 

The generation of hybridomas expressing monoclonal antibodies recognizing 
myostatin protein is carried out as follows: BALB/c mice are injected intraperitoneal^ with 
protein/adjuvant three times at one-month intervals, followed by a final injection into the tail vein 
10 shortly prior to cell fusion. Spleen cells are harvested and fused with NS-1 myeloma cells 

(American Type Culture Collection, Rockville, MD) using polyethylene glycol 4000 according to 
standard protocols (Kennett, 1979; Mirski, 1989). The cell fusion process is carried out as 
described in more detail below. 

The fused ceils are plated into 96-well plates with peritoneal exudate cells and 
1 5 irradiated spleen ceils from BALB/Ccmice as feeder layers and selection with hypoxanthine, 
aminopterin, and thymidine (HAT medium) is performed. 

An ELISA assay is used as an initial screening procedure. 1-10 pg of purified 
myostatin (cleaved from the fusion protein) in PBS is used to coat individual wells, and 50-100 pi 
per well of hybridoma supernatants is incubated. Horseradish peroxidase-conjugated anti- 
20 mouse antibodies are used for the colorimetric assay. 

Positive hybridomas are cloned by limiting-dilution and grown to large-scale for 
freezing and antibody production. Various positive hybridomas are selected for usefulness in 
western blotting and immunohistochemistry, as well as for cross reactivity with myostatin proteins 
from different species, for example the mouse and human proteins. 
25 Alternatively, active immunization by the generation of an endogenous antibody 

by direct exposure of the host animal to small amounts of antigen can be carried out. Active 
immunization involves the injection of minute quantities of antigen (g) which probably will not 
induce a physiological response and will be degraded rapidly. Antigen will only need to be 
administered as prime and boost immunizations in much the same manner as techniques used 
30 to confer disease resistance (Pell et a/., 1997). 

Antisense nucleic acids or oligonucleotides (RNA or preferably DNA) can be 
used to inhibit myostatin production in order to increase muscle mass of an animal. Antisense 
oligonucleotides, typically 15 to 20 bases long, bind to the sense mRNA or pre mRNA region 
coding for the protein of interest, which can inhibit translation of the bound mRNA to protein. The 
35 cDNA sequence encoding myostatin can thus be used to design a series of oligonucleotides 
which together span a large portion, or even the entire cDNA sequence. These oligonucleotides 
can be tested to determine which provides the greatest inhibitory effect on the expression of the 
protein (Stewart, 1996). The most suitable mRNA target sites include 5'- and 3-untranslated 
regions as well as the initiation codon. Other regions might be found to be more or less effective. 
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Alternatively, an antisense nucleic acid or oligonucleotide may bind to myostatin coding or 
regulatory sequences. 

Rather than reducing myostatin activity by inhibiting myostatin gene expression 
at the nucleic acid level, activity of the myostatin protein may be directly inhibited by binding to an 
5 agent, such as, for example, a suitable small molecule or a monoclonal antibody. 

It will of course be understood, without the intention of being limited thereby, that 
a variety of substitutions of amino acids is possible while preserving the structure responsible for 
myostatin activity of the proteins disclosed herein. Conservative substitutions are described in the 
patent literature, as for example, in United States Patent No. 5,264,558 or 5,487,983. It is thus 

10 expected, for example, that interchange among non-polar aliphatic neutral amino acids, glycine, 
alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions among the 
polar aliphatic neutral amino acids, serine, threonine, methionine, asparagine and glutamine 
could possibly be made. Substitutions among the charged acidic amino acids, aspartic acid and 
glutamic acid, could probably be made, as could substitutions among the charged basic amino 

15 acids, lysine and arginine. Substitutions among the aromatic amino acids, including 

phenylalanine, histidine, tryptophan and tyrosine would also likely be possible. These sorts of 
substitutions and interchanges are well known to those skilled in the art. Other substitutions 
might well be possible. Of course, it would also be expected that the greater the percentage, of 
homology, i.e., sequence similarity, of a variant protein with a naturally occurring protein, the 

20 greater the retention of metabolic activity. Of course, as protein variants having the activity of 
myostatin as described herein are intended to be within the scope of this invention, so are 
nucleic acids encoding such variants. 

A further advantage may be obtained through chimeric forms of the protein, as 
known in the art. A DNA sequence encoding the entire protein, or a portion of the protein, could 

25 thus be linked, for example, with a sequence coding for the C-terminal portion of E. coli S- 

galactosidase to produce a fusion protein. An expression system for human respiratory syncytial 
virus glycoproteins F and G is described in United States Patent No. 5,288,630 issued February 
22, 1994 and references cited therein, for example. 

A recombinant expression vector of the invention can be a plasmid, as described 

30 above. The recombinant expression vector of the invention further can be a virus, or portion 
thereof, which allows for expression of a nucleic acid introduced into the viral nucleic acid. For 
example, replication defective retroviruses, adenoviruses and adeno-associated viruses can be 
used. 

The recombinant expression vectors of the invention can be used to make a 
35 transformant host cell including the recombinant expression vector. The term "transformant host 
ceir is intended to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms "transformed with", 
"transfected with", "transformation" and "transfection" are intended to encompass introduction of 
nucleic acid (e.g. a vector) into a cell by one of many possible techniques known in the art. 
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Prokaryotic cells can be transformed with nucleic acid by, for example, electroporation or 
calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells 
via conventional techniques such as calcium phosphate or calcium chloride coprecipitation, 
DEAE-dextran-mediated transfection, lipofection, electroporation or microinjection. Suitable 
5 methods for transforming and transfecting host cells are known (Sambrook, 1 989). 

The number of host celts transformed with a recombinant expression vector of 
the invention by techniques such as those described above will depend upon the type of 
recombinant expression vector used and the type of transformation technique used. Plasmid 
vectors introduced into mammalian cells are integrated into host cell DNA at only a low 
1 0 frequency. In order to identify these integrants, a gene that contains a selectable marker (e.g. 
resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those which confer resistance to certain drugs, such as 
G418 and hygromycin. Selectable markers can be introduced on a separate plasmid from the 
nucleic acid of interest or, preferably, are introduced on the same plasmid. Host cells 
1 5 transformed with one or more recombinant expression vectors containing a nucleic acid of the 
invention and a gene for a selectable marker can be identified by selecting for cells using the 
selectable marker. For example, if the selectable marker encodes a gene conferring neomycin 
resistance (such as pRc/CMV), transformant cells can be selected with G41 8. Cells that have 
incorporated the selectable marker gene will survive, while the other cells die. 
20 Nucleic acids which encode myostatin proteins can be used to generate 

transgenic animals. A transgenic animal (e.g., a mouse) is an animal having cells that contain a 
transgene, which transgene is introduced into the animal or an ancestor of the animal at a 
prenatal, e.g., an embryonic stage. A transgene is a DNA which is integrated into the genome of 
a cell from which a transgenic animal develops. In one embodiment, a bovine cDNA, comprising 
25 the nucleotide sequence shown in SEQ ID NO:1 , or an appropriate variant or subsequence 
thereof, can be used to generate transgenic animals that contain cells which express bovine 
myostatin. Likewise, variants such as mutant genes (e.g. SEQ ID NO;3) can be used to generate 
transgenic animals. This could equally well be done with the human myostatin protein and 
variants thereof. "Knock out" animals, as described above, can also be generated. Methods for 
30 generating transgenic animals, particularly animals such as mice, have become conventional in 
the art are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. In a preferred 
embodiment, plasmids containing recombinant molecules of the invention are microinjected into 
mouse embryos. In particular, the plasmids are microinjected into the male pronuclei of fertilized 
one-cell mouse eggs; the injected eggs are transferred to pseudo-pregnant foster females; and, 
35 the eggs in the foster females are allowed to develop to term. (Hogan, 1986). Alternatively, an 
embryonal stem cell line can be transfected with an expression vector comprising nucleic acid 
encoding a myostatin protein, and cells containing the nucleic acid can be used to form 
aggregation chimeras with embryos from a suitable recipient mouse strain. The chimeric 
embryos can then be implanted into a suitable pseudopregnant female mouse of the appropriate 



99C2667A1 I > 



WO 99/02667 



PCT/IB98/01197 



-25- 

strain and the embryo brought to term. Progeny harboring the transfected DNA in their germ 
cells can be used to breed uniformly transgenic mice. 

Such animals could be used to determine whether a sequence related to an 
intact myostatin gene retains biological activity of myostatin. Thus, for example, mice in which 
5 the murine myostatin gene has been knocked out and containing the nucleic acid sequence 
identified as SEQ ID NO:1 could be generated along with animals containing the nucleic acid 
sequence identified as SEQ ID NO:3. The animals could be examined for display of muscular 
hyperplasia, especially in comparison with knockout mice, which are known to display such. In 
this way it can be shown that the protein encoded by SEQ ID NO:3 lacks myostatin activity within 
10 the context of this invention while the protein encoded by the nucleic acid sequence identified as 
SEQ ID NO:1 possesses biological activity of myostatin. 

In such experiments, muscle cells would be particularly targeted for myostatin 
(and variants) transgene incorporation by use of tissue specific enhancers operatively linked to 
the encoding gene. For example, promoters and/or enhancers which direct expression of a 
15 gene to which they are operatively linked preferentially in muscle cells can be used to create a 
transgenic animal which expresses a myostatin protein preferentially in muscle tissue. 
Transgenic animals that include a copy of a myostatin transgene introduced into the germ line of 
the animal at an embryonic stage can also be used to examine the effect of increased myostatin 
expression in various tissues. 
20 The pattern and extent of expression of a recombinant molecule of the invention 

in a transgenic mouse is facilitated by fusing a reporter gene to the recombinant molecule such 
that both genes are co-transcribed to form a polycistronic mRNA. The reporter gene can be 
introduced into the recombinant molecule using conventional methods such as those described 
in Sambrook et a/. f (Sambrook, 1989). Efficient expression of both cistrons of the polycistronic 
25 mRNA encoding the protein of the invention and the reporter protein can be achieved by 

inclusion of a known internal translational initiation sequence such as that present in poliovirus 
mRNA. The reporter gene should be under the control of the regulatory sequence of the 
recombinant molecule of the invention and the pattern and extent of expression of the gene 
encoding a protein of the invention can accordingly be determined by assaying for the phenotype 
30 of the reporter gene. Preferably the reporter gene codes for a phenotype not displayed by the 
host cell and the phenotype can be assayed quantitatively. Examples of suitable reporter genes 
include lacZ (p-galactosidase), neo (neomycin phosphotransferase), CAT (chloramphenicol 
acetyltransferase) dhfr (dihydrofolate reductase), aphlV (hygromycin phosphotransferase), lux 
(luciferase), uidA (p-glucuronidase). Preferably, the reporter gene is lacZ which codes for p- 
35 galactosidase. p-galactosidase can be assayed using the lactose analogue X-gal (5-bromo-4- 
chloro-3-indolyi-b-D-galactopyranoside) which is broken down by p-gaiactosidase to a product 
that is blue in color (Old). 

The present invention includes knocking out wild type myostatin in mammals, in 
order to obtain the desired effect(s) thereof. This is particularly true in cattle raised for beef 
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production. It may well prove advantageous to substitute a defective gene (e.g. SEQ ID NO:3 or 
it genomic analogue) rather than delete the entire sequence of DNA encoding for a protein 
having myostatin activity. A method of producing a transgenic bovine or transgenic bovine 
embryo is described in United States Patent No. 5,633,076, issued May 27, 1997, for example. 
5 The transgenic animals of the invention can be used to investigate the molecular 4 

basis of myostatin action. For example, it is expected that myostatin mutants in which one or 
more of the conserved cysteine residues has been deleted would have diminished activity in 
relation to a wild type myostatin protein in which all such residues are retained. Further, deletion 
of proteolytic cleavage site would likely result in a mutant lacking biological activity of myostatin. 
tO Transgenesis can be used to inactivate myostatin activity. This could be 

achieved using either conventional transgenesis, i.e. by injection in fertilized oocytes, or by gene 
targeting methods using totipotent cell lines such as ES (embryonic stem cells) which can then 
be injected in oocytes and participate in the development of the resulting organisms or whose 
nucleus can be transferred into unfertilized oocytes, nucleus transfer or cloning. 
15 it is also possible to create a genetically altered animal in which the double- 

muscling trait is dominant so that the animal would be more useful in cross-breeding. Further, in 
a particular aspect, the dominant trait would be male specific. In this way, bulls would be double- 
muscled but cows would not be. In addition, or alternatively, the trait would also be unexpressed 
until after birth or inducible. If inducible the trait could be induced after birth to avoid the calving 
20 difficulties described above. 

There are at least three approaches that can be taken to create a dominant 
u mh" allele. Because functional myostatin, a member of the TGF-fc superfamily, is a dimer, 
dominant negative myostatin mutations can be created (Herskowitz et a/., 1987; Lopez et a/., 
1992). According to one method, this is accomplished by mutating the proteolytic processing 
25 site of myostatin. To enhance the dominant negative effect, the gene can be put under the 
control of a stronger promoter such as the CMV promoter or that of a myosin gene, which is 
tissue specific, i.e., expressed only in skeletal muscle. Alternatively, an antisense sequence of 
that encoding myostatin could be incorporated into the DNA, so that complementary mRNA 
molecules are generated, as understood by a person skilled in the art. Optionally, a ribozyme 
30 could be added to enhance mRNA breakdown. In another approach, ere recombinase 

generate/ribozyme approach or the Cre-lox P system could be used (Hoess et a/., 1982; Gu et 
a/., 1994). 

Male specificity can be achieved by placing the dominant mh alleles on the Y 
chromosome by homologous recombination. 
35 Inducibiiity can be achieved by choosing promoters with post-natal expression in 

skeletal muscle or using inducible systems such at he Tet-On and Tet-Off systems could be 
used (Gossen etal., 1992; Shockett ef a/., 1996). 

Using conventional transgenesis a gene coding for a myostatin antisense is 
injected, for example, by inverting the orientation of the myostain gene in front of its natural 
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promoter and enhancer sequences. This is followed by injection of a gene coding for an anti- 
myostain ribozyme, i.e. an RNA that would specifically bind to endogenous myostain mRNA and 
destroy it via its "ribozyme" activity. 

Also, through gene targeting, a conventional knock-out animal can be 
5 generated, specific mutations by gene replacement can be engineered. It is possible to 

inactivate the myostain gene at a specific developmental time, such as after birth to avoid calving 
difficulties. As mentioned above, this could be achieved using the Cre-lox P systems in which 
1.ox P sides are engineered around the myostain gene by homologous recombination (gene 
targeting), and mating these animals with transgenic animals having a Cre transgene (coding for 
1 0 the Cre recombinase existing DNA flanked by J oxP sides) under the dependence of a skeletal 
muscle specific promoter only active after birth. This is done to obtain individuals that would 
inactivate their myostain gene after birth. As mentioned above, there are also gene targeting 
systems that allow genes to be turned on and off by feeding an animal with, for example, an 
antibiotic. In such an instance, one engineers an operator between the promoter of the gene and 
1 5 the gene itself. This operator is the target of a repressor which when binding inactivates the gene 
(for example, the lac operon in E. coif). The repressor is brought into the cell using conventional 
transgenesis, for example, by injection of the gene coding for the repressor. 

Transgenic animals of the invention can also be used to test substances for the 
ability to prevent, slow or enhance myostatin action. A transgenic animal can be treated with the 
20 substance in parallel with an untreated control transgenic animal. 

The antisense nucleic acids and oligonucleotides of the invention are useful for 
inhibiting expression of nucleic acids (e.g. mRNAs) encoding proteins having myostatin activity. 

The isolated nucleic acids and antisense nucleic acids of the invention can be 
used to construct recombinant expression vectors as described previously. These recombinant 
25 expression vectors are then useful for making transformant host cells containing the recombinant 
expression vectors, for expressing protein encoded by the nucleic acids of the invention, and for 
isolating proteins of the invention as described previously. The isolated nucleic acids and 
antisense nucleic acids of the invention can also be used to construct transgenic and knockout 
animals as described previously. 
30 The isolated proteins of the invention are useful for making antibodies reactive 

against proteins having myostatin activity, as described previously. Alternatively, the antibodies of 
the invention can be used to isolate a protein of the invention by standard immunoaffinity 
techniques. Furthermore, the antibodies of the invention, including bispecific antibodies are 
useful for diagnostic purposes. 
35 Molecules which bind to a protein comprising an amino acid sequence shown in 

SEQ ID NO:2 can also be used in a method for killing a cell which expresses the protein, wherein 
the cell takes up the molecule, if for some reason this were desirable. Destruction of such cells 
can be accomplished by labeling the molecule with a substance having toxic or therapeutic 
activity. The term "substance having toxic or therapeutic activity" as used herein is intended to 
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inclucle molecules whose action can destroy a cell, such as a radioactive isotope, a toxin (e.g. 
diphtheria toxin or ricin), or a chemotherapeutic drug, as well as cells whose action can destroy a 
cell, such as a cytotoxic cell. The molecule binding to the myostatin can be directly coupled to a 
substance having a toxic or therapeutic activity or may be indirectly linked to the substance. In 
5 one example, the toxicity of the molecule taken up by the cell is activated by myostatin protein. 

The invention also provides a diagnostic kit for identifying cells comprising a 
molecule which binds to a protein comprising an amino acid sequence shown in SEQ ID NO:2, 
for example, for incubation with a sample of tumor cells; means for detecting the molecule 
bound to the protein, unreacted protein or unbound molecule; means for determining the amount 
1 0 of protein in the sample; and means for comparing the amount of protein in the sample with a 
standard. Preferably, the molecule is a monoclonal antibody. In some embodiments of the 
invention, the detectability of the molecule which binds to myostatin is activated by said binding 
(e.g., change in fluorescence spectrum, loss of radioisotopic label). The diagnostic kit can also 
contain an instruction manual for use of the kit. 
1 5 The invention further provides a diagnostic kit for identifying cells comprising a 

nucleotide probe complementary to the sequence, or an oligonucleotide fragment thereof, 
shown in SEQ ID NO:1 , for example, for hybridization with mRNA from a sample of cells, e.g., 
muscle cells; means for detecting the nucleotide probe bound to mRNA in the sample with a 
standard. In a particular aspect, the invention is a probe having a nucleic acid molecule 
20 sufficiently complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as 
to bind thereto under stringent conditions. "Stringent hybridization conditions" takes on its 
common meaning to a person skilled in the art here. Appropriate stringency conditions which 
promote nucleic acid hybridization, for example, 6x sodium chloride/sodium citrate (SSC) at 
about 45°C are known to those skilled in the art. The following examples are found in Current 
25 Protocols in Molecular Biology, John Wiley & Sons, NY (1989), 6.3.1-6.3.6: For 50 ml of a first 
suitable hybridization solution, mix together 24 ml formamide, 12 ml 20x SSC, 0.5 mi 2 M 
Tris-HCI pH 7.6, 0.5 ml 100x Denhardt's solution, 2.5 ml deionized H 2 0, 10 ml 50% dextran 
sulfate, and 0.5 ml 10% SDS. A second suitable hybridization solution can be 1% crystalline 
BSA (fraction V), 1 mM EDTA, 0.5 M Na 2 HPC 4 pH 7.2, 7% SDS. The salt concentration in the 
30 wash step can be selected from a low stringency of about 2x SSC at 50°C to a high stringency of 
about 0.2x SSC at SOX. Both of these wash solutions may contain 0.1% SDS. In addition, the 
temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22°C, to high stringency conditions, at about 65°C. The cited reference 
gives more detail, but appropriate wash stringency depends on degree of homology and length 
35 of probe. If homology is 100%, a high temperature (65°C to 75°C) may be used. If homology is 
low, lower wash temperatures must be used. However, if the probe is very short (<1 00bp), lower 
temperatures must be used even with 100% homology, in general, one starts washing at low 
temperatures (37°C to 40°C), and raises the temperature by 3-5°C intervals until background is 
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low enough not to be a major factor in autoradiography. The diagnostic kit can also contain an 
instruction manual for use of the kit. 

The invention provides a diagnostic kit which can be used to determine the 
genotype of mammalian genetic material, for example. One kit includes a set of primers used 
5 for amplifying the genetic material. A kit can contain a primer including a nucleotide sequence 
for amplifying a region of the genetic material containing one of the naturally occurring mutations 
described herein. Such a kit could also include a primer for amplifying the corresponding region 
of the normal gene that produces functional myostatin. Usually, such a kit would also include 
another primer upstream or downstream of the region of interest complementary to a coding 
10 and/or non-coding portion of the gene. A particular kit includes a primer selected from a non- 
coding sequence of a myostatin gene. Examples of such primers are provided in Table 3 P 
designated as Exon1-5\ Exon1-3\ Exon2-5\ Exon3-5' and Exon3-3'. These primers are used to 
amplify the segment containing the mutation of interest. The actual genotyping is carried out 
using primers that target specific mutations described herein and that could function as allele- 
15 specific oligonucleotides in conventional hybridization, Taqman assays, OLE assays, etc. 
Alternatively, primers can be designed to permit genotyping by microsequencing. 

One kit of primers thus includes first, second and third primers, (a), (b) and (c), 
respectively. Primer (a) is based on a region containing a myostatin mutation, for example a 
region of the myostatin gene spanning the nt821det(11) deletion. Primer (b) encodes a region 
20 upstream or downstream of the region to be amplified by primer (a) so that genetic material 
containing the mutation is amplified, by PGR, for example, in the presence of the two primers. 
Primer (c) is based on the region corresponding to that on which primer (a) is based, but lacking 
the mutation. Thus, genetic material containing the non-mutated region will be amplified in the 
presence of primers <b) and (c). Genetic material homozygous for the wild type gene will thus 
25 provide amplified products in the presence of primers (b) and (c). Genetic material homozygous 
for the mutated gene will thus provide amplified products in the presence of primers (a) and (b). 
Heterozygous genetic material will provide amplified products in both cases. 

The invention provides purified proteins having biological activity of myostatin. 
The terms "isolated" and "purified" each refer to a protein substantially free of cellular material or 
30 culture medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. In certain preferred embodiments, the protein 
having biological activity of myostatin comprises an amino acid sequence identified as SEQ ID 
NO:2. Furthermore, proteins having biological activity of myostatin that are encoded by nucleic 
acids which hybridize under stringent conditions, as discussed above, to a nucleic acid 
35 comprising a nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:7 are encompassed 
by the invention. Proteins of the invention having myostatin activity can be obtained by expression 
in a suitable host cell using techniques known in the art. Suitable host cells include prokaryotic 
or eukaryotic organisms or cell lines, for example, yeast, E. co// f insect cells and COS 1 cells. 
The recombinant expression vectors of the invention, described above, can be used to express a 



EMSOOCID: <WO 990P.667A1 J_> 



WO 99/02667 PCT/IB98/01 197 

-30- 

protein having myostatinl activity in a host cell in order to isolate the protein. The invention 
provides a method of preparing an purified protein of the invention comprising introducing into a 
host cell a recombinant nucleic acid encoding the protein, allowing the protein to be expressed in 
the host cell and isolating and purifying the protein. Preferably, the recombinant nucleic acid is a 
5 recombinant expression vector. Proteins can be isolated from a host cell expressing the protein 
and purified according to standard procedures of the art, including ammonium sulfate 
precipitation, column chromatography (e.g. ion exchange, gel filtration, affinity chromatography, 
etc.), electrophoresis, and ultimately, crystallization (see generally, "Enzyme Purification and 
Related Techniques", Methods in Enzymology, 22, 233-577 (1 971)). 
1 o Alternatively, the protein or parts thereof can be prepared by chemical synthesis 

using techniques well known in the chemistry of proteins such as solid phase synthesis 
(Merrifield, 1 964), or synthesis in homogeneous solution (Houbenwcyl, 1 987). 

The protein of the invention, or portions thereof, can be used to prepare 
antibodies specific for the proteins. Antibodies can be prepared which bind to a distinct epitope in 
1 5 an unconserved region of a particular protein. An unconsented region of the protein is one which 
does not have substantial sequence homology to other proteins, for example other members of 
the myostatin family or other members of the TGFp superfamiiy. Conventional methods can be 
used to prepare the antibodies. For example, by using a peptide of a myostatin protein, 
polyclonal antisera or monoclonal antibodies can be made using standard methods. A mammal, 
20 (e.g. a mouse, hamster, or rabbit) can be immunized with an immunogenic form of the peptide 
which elicits an antibody response in the mammal. Techniques for conferring immunogenicity on 
a peptide include conjugation to carriers or other techniques well known in the art. For example, 
the peptide can be administered in the presence of adjuvant. The progress of immunization can 
be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 
25 immunoassay can be used to assess the levels of antibodies. Following immunization, antisera 
can be obtained and, if desired, polyclonal antibodies isolated from the sera. 

To produce monoclonal antibodies, antibody producing cells (lymphocytes) can 
be harvested from an immunized animal and fused with myeloma cells by standard somatic ceil 
fusion procedures, thus immortalizing these cells and yielding hybridoma cells. Such techniques 
30 are well known in the art. For example, the hybridoma technique originally developed by Kohier 
and Milstein (Kohier, 1 975) as well as other techniques such as the human B-cell hybridoma 
technique (Kozbor, 1983), the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole, 1985), and screening of combinatorial antibody libraries (Huse, 1989). 
Hybridoma cells can be screened immunochemicaliy for production of antibodies specifically 
35 reactive with the peptide, and monoclonal antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with a protein having the biological activity of myostatin, or a peptide 
fragment thereof. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. For 
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example, F(ab , ) 2 fragments can be generated by treating antibody with pepsin. The resulting 
Ftab^ fragment can be treated to reduce disulfide bridges to produce Fab' fragments. 

It is also known in the art to make chimeric antibody molecules with human 
constant regions. See, for example, Morrison et a\. y Takeda et a/., Cabilly et a/., Boss et a/., 
5 Tanaguchi et a/., Teng et al. (Morrison, 1985; Takeda, 1985; Cabilly; Boss; Tanaguchi; Teng, 
1982), European Patent Publication 0173494, United Kingdom Patent GB 2177096B, PCT 
Publication WO92/06193 and EP 0239400. It is expected that such chimeric antibodies would be 
less immunogenic in a human subject than the corresponding non-chimeric antibody. 

Another method of generating specific antibodies, or antibody fragments, 
10 reactive against protein having the biological activity of a myostatin protein, or a peptide fragment 
thereof, is to screen expression libraries encoding immunoglobulin genes, or portions thereof, 
expressed in bacteria, with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions can be expressed 
in bacteria using phage expression libraries. See for example Ward etal., Huse et ai; and 
15 McCafferty et a/. (Ward, 1989; Huse, 1989; McCafferty, 1990). Screening such libraries with, for 
example, a myostatin protein can identify immunoglobulin fragments reactive with myostatin. 
Alternatively, the SCID-hu mouse developed by Genpharm can be used to produce antibodies, 
or fragments thereof. 

The polyclonal, monoclonal or chimeric monoclonal antibodies can be used to 
20 detect the proteins of the invention, portions thereof or closely related isoforms in various 
biological materials, for example they can be used in an ELISA, radioimmunoassay or 
histochemical tests. Thus, the antibodies can be used to quantify the amount of a myostatin 
protein of the invention, portions thereof or closely related isoforms in a sample in order to 
determine the role of myostatin proteins in particular cellular events or pathological states. Using 
25 methods described hereinbefore, polyclonal, monoclonal antibodies, or chimeric monoclonal 
antibodies can be raised to nonconserved regions of myostatin and used to distinguish a 
particular myostatin from other proteins. 

The polyclonal or monoclonal antibodies can be coupled to a detectable 
substance or reporter system. The term "coupled" is used to mean that the detectable 
30 substance is physically linked to the antibody. Suitable detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
p-galactosidase, and acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
35 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 

fluorescein, dansyl chloride and phycoerythrin; an example of a luminescent material includes 
luminol; and examples of suitable radioactive material include 125 l; 13l l, 35 S and 3 H. in a preferred 
embodiment, the reporter system allows quantitation of the amount of protein (antigen) present. 
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Such an antibody-linked reporter system could be used in a method for 
determining whether a fluid or tissue sample of a subject contains a deficient amount or an 
excessive amount of the protein. Given a normal threshold concentration of such a protein for a 
given type of subject, test kits could thus be developed. 
5 The present invention allows the skilled artisan to prepare bispecific antibodies 

and tetrameric antibody complexes. Bispecific antibodies can be prepared by forming hybrid 
hybridomas (Staerz, 1986a &b). 

Compositions of the invention are administered to subjects in a biologically 
compatible form suitable for pharmaceutical administration in vivo. By "biologically compatible 

1 0 from suitable for administration in vivo" is meant a form of the composition to be administered in 
which any toxic effects are outweighed by the therapeutic effects of the composition. The term 
"subject" is intended to include living organisms in which a desired therapeutic response can be 
elicited, e.g. mammals. Examples of subjects include cattle, human, dogs, cats, mice, rats and 
transgenic species thereof. Administration of a therapeutically active amount of the therapeutic 

1 5 compositions of the present invention is defined as an amount effective, at dosages and for 
periods of time necessary to achieve the desired result. For example, a therapeutically active 
amount of a compound that inhibits the biological activity of myostatin protein may vary according 
to factors such as the age, sex, and weight of the individual, as well as target tissue and mode of 
delivery. Dosage regimes may be adjusted to provide the optimum therapeutic response. For 

20 example, several divided doses may be administered daily or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic situation. 

As far as the United States is concerned, this application is a Continuation-in- 
Part Application of prior United States Patent Application Serial No. 08/891 ,789, filed July 14, 
1 997, the specification of which is incorporated herein by reference. 

25 Those skilled in the art will know, or be able to ascertain using no more 

than routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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CLAIMS 

1 . A method of increasing muscle mass of a mammal having muscle cells in which myostatin is 
expressed, the method comprising administering to the mammal an effective amount of a nucleic 
acid molecule substantially complementary to at least a portion of mRNA encoding the myostatin 

5 and being of sufficient length to sufficiently reduce expression of the myostatin to increase the 
muscle mass. 

2. The method of claim 1 wherein the mammal is bovine. 

3. A method of increasing muscle mass of a mammal, the method comprising administering to 
the mammal an effective amount of a nucleic acid molecule having ribozyme activity and a 

10 nucleotide sequence substantially complementary to at least a portion of mRNA encoding 
myostatin and being of sufficient length to bind selectively thereto to sufficiently reduce 
expression of the myostatin so as to increase the muscle mass. 

4. The method of claim 3 wherein the mammal is bovine. 

5. A diagnostic kit, for determining the presence of muscular hyperplasia in a mammal from 
15 which a sample containing DNA of the mammal has been obtained, the kit comprising: 

first and second primers for amplifying the DNA, the primers being complementary to 
nucleotide sequences of the DNA upstream and down stream, respectively, of a 
mutation in the portion of the DNA encoding myostatin which results in muscular 
hyperplasia of the mammal, wherein at least one of the nucleotide sequences is 
20 selected to be from a non-coding region of the myostatin gene. 

6. The diagnostic kit of claim 5, further comprising a third primer complementary to a naturally 
occurring mutation of a coding portion of the myostatin gene. 

7. A diagnostic kit, for determining the genotype of a sample of mammalian genetic material, the 
kit comprising: 

25 a pair of primers for amplifying a portion of the genetic material corresponding to a 

nucleotide sequence which encodes at least a portion of a myostatin protein, 
wherein a first of the primers includes a nucleotide sequence sufficiently 
complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic 
acid molecule containing the mutation, the mutation being selected from the group 

30 of mutations resulting from: (a) deletion of 11 nucleotides beginning at nucleotide 

821 of the coding portion of SEQ ID NO:1; (b) deletion of 7 nucleotides beginning at 
nucleotide 41 9 of the coding sequence and insertion of the sequence 
AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of the coding 
sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 

35 coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 

313 of the coding sequence and insertion of A in place thereof; and combinations 
thereof. 
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8. The diagnostic kit of claim 7 wherein a second of the pair of primers is located entirely 
upstream or entirely downstream of the selected mutation or mutations. 

9. The diagnostic kit of claim 8 wherein a first said primer spans mutation (a) and further 
comprising a third primer which is sufficiently complementary to the nucleotide sequence 

5 identified as SEQ ID NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID 
NO:11. 

10. The diagnostic kit of claim 8 wherein a first said primer is sufficiently complementary to the 
inserted sequence of mutation (b) to prime amplification of a nucleic acid molecule containing 
mutation (b) and further comprising a third primer which is sufficiently complementary to the 

1 0 sequence corresponding to the 7 nucleotide deletion of mutation (b) to prime amplification of a 
nucleic acid molecule containing the 7 nucleotide deletion of mutation (b). 

1 1 . The diagnostic kit of claim 8 wherein a first said primer spans mutation (c) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (c) to prime amplification of a nucleic acid molecule 

15 lacking mutation (c). 

12. The diagnostic kit of claim 8 wherein a first said primer spans mutation (d) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (d) to prime amplification of a nucleic acid molecule 
lacking mutation (d). 

20 1 3. The diagnostic kit of claim 8 wherein a first said primer spans mutation (e) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (e) to prime amplification of a nucleic acid molecule 
lacking mutation (e). 

14. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
25 method comprising: 

obtaining a sample of material containing DNA from a said animal; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
wherein the absence of DNA having said nucleotide sequence indicates the presence of 
30 muscular hyperplasia in the animal. 

15. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the DNA in the 
presence of primers based on a nucleotide sequence encoding a protein having biological activity 
of myostatin. 

35 16. The method of claim 1 5 wherein DNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 
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17. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 

5 18. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of first and second primers based on first and second nucleotide sequences 
encoding spaced apart regions of the protein, wherein said regions flank a mutation known to 
naturally occur and which when present in both alleles of a said animal results in said muscular 

10 hyperplasia. 

19. The method of claim 18 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 

15 base pair deletion beginning at base pair no. 821 , and said first primer is selected to be upstream 
of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

20. The method of claim 14 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 

20 sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 
base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing said 
deletion. 

25 21 . The method of claim 1 9 wherein the animal is of a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

22. The method of claim 20 wherein the animal is a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

23. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
30 encoding a protein having biological activity of myostatin is present includes amplifying the DNA 

in the presence of a primer containing at least a portion of a mutation known to naturally occur 
and which when present in both alleles of a said animal results in said muscular hyperplasia. 

24. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 

35 obtaining a sample of material containing DNA from a said animal; and 

ascertaining whether DNA having a mutation as defined in claim 7 is present; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
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wherein the absence of DNA having said nucleotide sequence and presence of a said mutation 
indicates the presence of muscular hyperplasia in the animal. 

25. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 
5 obtaining a sample of the animal containing mRNA; and 

ascertaining whether an mRNA encoding a protein having biological activity of myostatin 
is present in the sample, 
wherein the absence of said mRNA indicates the presence of muscular hyperplasia in the 
animal. 

1 0 26. The method of claim 25 wherein the sample is of muscle tissue or wherein the tissue is 
skeletal muscle tissue. 

27. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the mRNA in the 
presence of primers substantially complementary to the nucleotide sequence encoding the 

15 protein. 

28. The method of claim 27 wherein mRNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 

20 29. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of primers substantially complementary to a nucleotide sequence encoding the 
N-terminal and the C-terminal, respectively, of the protein having biological activity of myostatin. 
30. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 

25 encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of first and second primers substantially complementary to first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein said regions flank a 
mutation known to naturally occur and which when present in both alleles of a said animal results 
in said muscular hyperplasia. 

30 31 . The method of claim 30 wherein an mRNA of said animal not displaying muscular 

hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 
nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 , and said first primer is selected to be 

35 upstream of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no, 274. 

32. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of a primer containing a nucleotide sequence complementary to at least a 
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portion of a mutation known to naturally occur in a said animal and which when present in both 
alleles of a said animal results in said muscular hyperplasia. 

33. The method of claim 32 wherein an mRNA of said animal not displaying muscular 
hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 

5 nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1-base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
deleted portion. 

34. The method of claim 31 wherein the animal is of a breed selected from Belgian Blue, 
1 0 Asturiana, Parthenaise and Rubia Gallega. 

35. A method for determining the presence of muscular hyperplasia in a mammal, the method 
comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity 
15 of myostatin, is present, and whether a sequence of the DNA encoding (b) an allelic 

protein lacking the activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 

36. The method of claim 35 wherein (b) contains a naturally occurring mutation responsible for 
20 the lack of activity. 

37. The method of claim 35 wherein the mammal is a human. 

38. The method of claim 37 wherein ascertaining whether a sequence of the DNA encoding (a) 
is present and whether a sequence of the DNA encoding (b) is present includes amplifying the 
DNA 

25 * in the presence of primers based on a nucleotide sequence encoding a protein having biological 
activity of myostatin. 

39. The method of claim 38 wherein said primers are based on the sequence identified as SEQ 
ID NO:7. 

40. A method for determining the presence of muscular hyperplasia in a mammal, the method 
30 comprising: 

obtaining a sample of material containing mRNA from the mammal; and 

ascertaining whether a sequence of the mRNA encoding (a) a protein having biological 

activity of myostatin, is present, and whether a sequence of the mRNA encoding 

(b) a protein at least partially encoded by a truncated nucleotide sequence 
35 corresponding to substantially the sequence of the mRNA and lacking the 

activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia irrthe mammal. 
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41 . The method of claim 40 wherein the mRNA encoding (a) and the truncated sequence 
correspond to alleles of DNA of the mammal. 

42. The method of claim 40 wherein the mammal is human. 

43. The method of claim 42 wherein ascertaining whether a sequence of the mRNA encoding 

5 (a) is present, and whether a sequence of the mRNA encoding (b) is present includes amplifying 
the mRNA in the presence of a pair of primers complementary to a nucleotide sequence 
encoding a protein having biological activity of myostatin. 

44. The method of claim 43 wherein each said primer contains a truncated nucleotide sequence 
substantially complementary to a portion of the sequence identified as SEQ ID NO:7. 

1 0 45, The method of claim 44 wherein the truncated sequence contains at least 50 consecutive 
nucleotides substantially corresponding to about 10, or between about 10 and 20, or between 
about 20 and 30, or between about 30 and 40, or between about 40 and 50 consecutive 
nucleotides of SEQ ID NO;7. 

46. A method for determining the presence of muscular hyperplasia in a mammal, the method 
15 comprising: 

obtaining a tissue sample of containing mRNA of the mammal; and 
ascertaining whether an mRNA encoding a mutant type myostatin protein lacking 
biological activity of myostatin is present, 
wherein the presence of a said mRNA encoding a mutant type myostatin protein indicates the 
20 presence of muscular hyperplasia in the mammal. 

47. The method of claim 46 wherein the mutant type myostatin protein lacing biological activity is 
* encoded by a naturally occurring allele of DNA encoding the mRNA. 

48. A method for determining the presence of double muscling in a bovine animal, the method 
comprising: 

25 obtaining a sample of material containing DNA from the animal; and 

ascertaining whether the DNA contains the nucleotide coding sequence identified as 
SEQIDNO:11, 

wherein absence of the sequence indicates double muscling in the animal. 

49. The method of claim 34 wherein the animal is of a breed selected from Belgian Blue, 
30 Asturiana, Parthenaise and Rubia Gallega. 

50. A method for determining the myostatin genotype of a mammal, comprising: 

obtaining a sample of material containing nucleic acid of the mammal, wherein the nucleic 

acid is uncontaminated by heterologous nucleic acid; 
ascertaining whether the sample contains a (i) nucleic acid molecule encoding a protein 
35 having biological activity of myostatin; and 

ascertaining whether the sample contains an (ii) allelic nucleic acid molecule encoding a 
protein lacking biological activity of myostatin. 

51 . The method of claim 50 wherein the mammal is human and (i) comprises a nucleic acid 
- sequence substantially homologous with the sequence identified as SEQ ID NO:7. 
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52. A purified protein having biological activity of myostatin, and having an amino acid sequence 
identified as SEQ ID NO:2, or a conservatively substituted variant thereof. 

53. An isolated nucleic acid molecule encoding a protein of claim 52. 

54. An isolated nucleic acid molecule comprising a DNA molecule having the nucleotide 

5 sequence identified as SEQ ID NO:1 or which varies from the sequence due to the degeneracy 
of the genetic code, or a nucleic acid strand capable of hybridizing with at least one said nucleic 
acid molecule under stringent hybridization conditions. 

55. Isolated mRNA transcribed from DNA having a sequence which corresponds to a nucleic 
acid molecule according to claim 54. 

10 56. Isolated DNA having a sequence according to claim 54 in a recombinant cloning vector. 

57. A microbial cell containing and expressing heterologous DNA which is complementary a 
nucleic acid molecule of claim 54. 

58. A transfected cell line which expresses a protein of claim 52. 

59. A process for producing the protein of claim 52 comprising: 

15 preparing a DNA fragment including a nucleotide sequence which encodes said protein; 

incorporating the DNA fragment into an expression vector to obtain a recombinant DNA 
molecule which includes the DNA fragment and is capable of undergoing 
replication; 

transforming a host cell with said recombinant DNA molecule to produce a transformant 
20 which can express said protein; 

cuituring the transformant to produce said protein; and 
recovering said protein from resulting cultured mixture. 

60. A method of increasing muscle mass in a mammal, comprising administering an effective 
amount of an antibody to myostatin to said mammal. 

25 61. A method of increasing muscle mass in a mammal, comprising raising an autoantibody to 
the myostatin the in the mammal. 

62. The method of claim 61 wherein raising the autoantibody includes administering a protein 
having myostatin activity to the mammal. 

63. A method of increasing muscle mass in a mammal in need thereof, comprising 

30 administering to the mammal an effective amount of an antisense nucleic acid or oligonucleotide 
substantially complementary to at least a portion of the sequence identified as SEQ ID NO:1 or 
SEQ ID NO:5, or SEQ ID NO:7. 

64. The method of claim 63 wherein the portion is at least 5 nucleotide bases in length. 

65. The method of claim 64 wherein the mammal is a bovine and the sequence is the sequence 
35 identified as SEQ ID NO:1. 

66. A method of increasing muscle mass in a mammal, comprising administering to the 
mammal an effective amount of an antibody to the myostatin. 

67. A probe comprising a nucleic acid molecule sufficiently complementary with a sequence 
identified as SEQ ID NO:1 , or its complement, so as to bind thereto under stringent conditions. 
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68. The probe of claim 67 wherein the sequence is between about 8 and about 1 195 
nucleotides in length, or between about 1 5 and 1 1 95 nucleotides in length, or between about 25 
and 1195 nucleotides in length, or between about 35 and 1195 nucleotides in length, or between 
about 45 and 1 1 95 nucleotides in length, or between about 55 and 1 1 95 nucleotides in length, or 

5 between about 65 and 1 1 95 nucleotides in length, or between about 75 and 1 1 95 nucleotides in 
length, or between about 75 and 1 195 nucleotides in length, or between about 85 and 1 1 95 
nucleotides in length, or between about 95 and 1 195 nucleotides in length, or between about 105 
and 1 1 95 nucleotides in length, or between about 115 and 1 1 95 nucleotides in length. 

69. A method for identifying a nucleotide sequence of a mutant gene encoding a myostatin 
1 0 protein of a mammal displaying muscular hyperplasia, the method comprising: 

obtaining a sample of material containing DNA from the mammal; and 

probing the sample using a nucleic acid probe based on a nucleotide sequence of a 

known gene encoding myostatin in order to identify nucleotide sequence of the 

mutant gene. 

15 70. The method of claim 69, wherein the probe is based on a nucleotide sequence of a non- 
coding region of the gene. 

71 . The method of claim 70 wherein the probe is based on SEQ ID NO:54. 

72. The method of claim 71 wherein the probe is at least 8 nucleic acids in length. 

73. The method of claim 69, wherein the step of probing the sample includes exposing the DNA 
20 to the probe under hybridizing conditions and further comprising isolating hybridized nucleic acid 

molecules. 

74. The method of claim 73, further comprising the step of sequencing isolated DNA. 

75. The method of claim 69, wherein the mammal is a bovine mammal and the probe is based 
on a said nucleotide sequence identified as SEQ ID NO:1 . 

25 76. The method of claim 74, further comprising the step of isolating and sequencing a cDNA or 
mRNA encoding the complete mutant myostatin protein. 

77. The method of claim 71, further comprising the step of isolating and sequencing a functional 
wild type myostatin from a said mammal not displaying muscular hyperplasia. 

78. The method of claim 76, further comprising comparing the complete coding sequence of the 
30 complete mutant myostatin protein with, if the coding sequence for a functional wild type 

myostatin from a said mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from a said mammal is previously unknown, (2) the 
sequence determined according to claim 74 or claim 77, to determine the location of any 
mutation in the mutant gene. 
35 79. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether the DNA contains a said mutation determined according to claim 
78. 
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80. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mamma! is substantially that of claim 78, comprising: 

obtaining a sample of material containing mRNA from the mammal; and 
ascertaining whether the mRNA contains a said mutation determined according to claim 
5 78. 

81. A primer composition useful for the detection of a nucleotide sequence encoding a myostatin 
comprising a first nucleic acid molecule based on a nucleotide sequence located upstream of a 
said mutation determined according to claim 78 and a second nucleic acid molecule based on a 
nucleotide sequence located downstream of the mutation. 

10 82. A probe comprising a nucleic acid molecule based on a nucleotide sequence of claim 74 or 
claim 76 and spanning a said mutation determined according to claim 78. 
83. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
phenotype being conferred by a transgene contained in the somatic and germ cells of the 
mammal, the transgene encoding a myostatin protein having a dominant negative mutation. 

15 84. The transgenic mammal of claim 83 wherein the mammal is male and non-human and the 
transgene is located on the Y chromosome. 

85. The transgenic mammal of claim 83 wherein the mammal is bovine and the transgene is 
located to be under the control of a promoter which normally a promoter of a myosin gene. 

86. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
20 phenotype being conferred by a transgene having a sequence antisense to that encoding a 

myostatin protein of the mammal. 

87. The transgenic mamma! of claim 86 wherein the mamma! is bovine and the transgene is 
located on the Y chromosome. 

88. The transgenic mammal of claim 86 wherein the transgene further comprises a sequence 
25 which when transcribed obtains an mRNA having ribozyme activity. 

89. A transgenic non-human mammal having a phenotype characterized by muscular 
hyperplasia, said phenotype being inducible and being conferred by a myostatin gene flanked by 
J oxP sides and a Cre transgene under the dependence of an inducible promoter. 

90. A transgenic non-human male mammal having a phenotype characterized by muscular 
30 hyperplasia , said phenotype being conferred by a myostatin gene flanked by J oxP sides and a 

Cre transgene located on the Y chromosome. 

91 . A method for determining whether a sample of mammalian genetic material is capable of a 
conferring a phenotype characterized by muscular hyperplasia, comprising ascertaining whether 
the genetic materia! contains a nucleotide sequence encoding a protein having biological activity 

35 of myostatin, wherein the absence of said sequence indicates the presence of muscular 
hyperplasia in the animal. 

92. A transgenic bovine having a genome lacking a gene encoding a protein having biological 
activity of myostatin. 
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93. A transgenic mouse having a genome containing a gene encoding a human protein having 
biological activity of myostatin or containing a gene encoding a bovine protein having biological 
activity of myostatin. 

94. A transgenic bovine having a gene encoding a bovine protein having biological activity of 
5 myostatin and heterologous nucleotide sequence antisense to the gene. 

95. A transgenic bovine of claim 94, further comprising a gene encoding a nucleic acid 
sequence having ribozyme activity and in transcriptional association with the nucleotide sequence 
antisense to the gene. 
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SEQUENCE ID NO. 1 



PCT7IB98/01197 



1 AGGAAGAATA AGAACAAGGG AAAAGATTCT ATTGATTTTA AAACCATGCA 
51 AAAACTGCAA ATCTCTGTTT ATATTTACCT ATTTATGCTG ATTGTTGCTG 
101 GCCCAGTGGA TCTGAATGAG AACAGCGAGC AGAAGGAAAA TGTGCAAAAA 
151 GAGGGGCTCT GTAATGCATG TTTGTGGAGG (SAAAACACTA CATCd'CAAG 
201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAAClTCGC CTGGAAACAG 
2 51 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTTTTGCC CAAGGCTCCT 
301 CCJLCTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAG ATCCCAGCAG 
351 TGACGGCTCC TTGGAAGACG ATGACTACCA CGCCAGGACG GAAACCGTCA 
401 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGCAAAACCC 
451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 
501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 
551 TGTTTG TGCA AATCCTGACA CTCATCAAAC CCATGAAAGA CGGTACAAGG 
601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 
651 TTGCCAGAGC ATTGATGTGA AGACACTGTT GCAGAACTGG CTCAAACAAC 
701 CTGAATCCAA CTTAGGCATT GAAATCAAAG CVITAGATGA GAATGGCCAT 
751 GATCTTGCTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT ' 
801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGA1TTTG 
851 GGCTTGATTG T GATGAAC AC TCC AO AG AAT CTCGATGCTG TCUTTACCCT 
901 CTAACTGTGG ATTTTGAAGC TTTTGGATGG GATTGGATTA TTGCACCTAA 
95] AAGATATAAG GCCAATTACT GCTCTGGAGA ATGTGAA'ITT GTATTTTTGC 
1001 AAAAGTATCC T CAT AC C CAT CTTGTGCACC AAGCAAACCC CAGAGCTTCA 
1051 CCCGGCCCCT GCTGTACTCC TACAAAGATG TCTCC AATTA ATATGCTATA 
1101 TTTTAATGGC CAAGGACAAA TAATATACGG GAAGATTCCA GCCA'!*CCJTAG 
1151 TAGATCGCTG T GGGTGTTC A TGAGTCTATA TTtGGgTTCA TAAGC 
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SEQUENCE ID NO. 2 



1 


JTMQKLQISVY 


IYLFMLIVAG 


PVDLNENSEQ 


KRNVEKEULC 


NACLWRENTT 


51 


SSRLEATKIQ 


ILSKLRLETA 


PNIKKDAIRQ 


LLPKAPPLLE 


L1JJQPDVQRD 


101 


ASSDGSLEDD 


DYIIARTETVI 


TMPTESDLLT 


UVEGKPKCCP 


FEFSSKIQYN 


151 


KT.VKAQLWIY 


L.R PVKTPATV 


FVQILRI.TKP 


MK1XJTRYTGI 


KSLXLEMNPG 


201 


TGTWQSIDVK 


TVLQNWLKQP 


ESNLGIETTCA 


LiDKNGHDTjAV 


TPPEPGEDGL 


251 


T P FL.EVKVTD 


TPKRSRRDFC 


LDCDEHSTES 


RCCP.YPLTVn 


FEAfCWDWTI 


301 


APKRTKANYC 


SCECEFVFLQ 


KYPHTHLiVHQ 


ANPRGSAGPC 


CTPTKMSPIN 


351 


MLYFNGECQI 


TYGKIPAMW 
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SEQUENCE ID NO- 3 

1 AGGAAGAATA AGAACAAGGG AAAAGATTGT ATTGAT1 "ITA AAACCATGCA 

51 AAAACTCCAA ATCTCTGTTT ATATTTACCT ATTTATGC* I 'U ATTGTTGCTG 

101 GCCCAGTCCA TCTGAATGAG AACAGCGAGC AGAAGGAAAA MXJTGGAAAAA 

15? CAGGGGCTGT GTAATGCATG TTTGTGGAGG GAAAACACTA CATCCTCAAG 

201 ACTACAAGCC ATAAAAATCC AAATCCTCAG TAAACTTCGC CTGGAAACAG 

251 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTTTTCJCC CAAGGCTCCT 

3 01 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAC ATGCCAGCAG 
351 TGACGGCTCC TTGGAAGACG ATGACT AC C A CGCCAGGACG GAAACGGTCA 

4 01 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGOAAAACCC 
451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 
501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 
551 TGTTTGTGCA AATCCTGACA CTCATCAAAC CCATGAAAGA CGGTACAAGC 
601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 
651 TTGGCAGAGC ATTGATGTGA AGACAGTGTT GCAGAACTGG CTCAAACAAC 
701 CTGAATCCAA CTTACGCATT GAAATCAAAG CTTTAGATGA GAATGGC CAT 
751 GATCTTGCTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT 
801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGATTTTG 
851 GGCTTGATTG TGACAGAATC TCGATGCTGT CGTTACCCTC TAACTGTGGA 

9 01 TTTTGAAGCT TTTGGATGGG ATTGGATTAT TCCACCTAAA AGATATAAGG 

951 CCAATTACTG CTCTGGAGAA TGTGAATTTG TATTTTTGCA AAAGTATCCT 

1001 CATACCCATC TTGTGCACCA AGCAAACCCC AGAGGTTCAG CCGGCCCCTC 

1051 CTGTACTCCT AC AAAGAT GT CTCCAATTAA TATGCTATAT TTTAATGGCG 

1101 AAGGACAAAT AATATACGGG AACATTCCAG CCATGGTAGT AAATCGCTGT 

1151 GGGTGTTCAT GAGGTCTATA TTTGGTTCAT AGCTTCCTGA AACATGGAAG 

1201 GTCTTCCCCT CAACAATTTT GAAACTGTTG AAATTATGT 
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SEQUENCE ID NO. 4 

IYLFMLIVAG PVDLNENSEQ KENVEKEGLrC NACLWK KNTT 

51 SSRLtEAIKTQ ILSKLRLETA PNISKDAIRQ I.LPKAPPLLE LIL^/FDVQRD 

101 ASSDGSLEDD DTHARTETVJ TMPTESDLLT QVEGKPKCCF PK KSSKIQYN 

151 KLVKAQLV7IY TiR PVKTPATV FVQIIjRLXKP MKDGTRYTGI RST.K3..13MNPG 

201 TGIWQSIDVK TVLQNWLKQP ESNIiGTEIKA LDENGHDLAV TFPEPG£lX5L< 

251 TPPLHVKVTD TPKR.SRROFG TjDCDRISMLS LPSNCGF 
301 
351 
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SEQUENCE ID NO. 5 



PCT/IB98/01197 



1 GTCTCTCGGA CCGTACATGC ACTAATAXTT CACTTGGCAT TACTCAAAAG 

SI CAAAAAGAAG AAATAAGAAC AACCGAAAAA AAAAGATTGT GCTGATTTTT 

3 01 AAAATGATGC AAAAACTGCA AATGTATGTT TATATTTACC TCTTCATGCT 

151 GATTGCTGCT GGCCCAGTGG ATCTAAATGA GGGCAGTGAG AGAGAAGAAA 

201 ATGTGGAAAA AGAGGGGCTC TGTAATGCAT GTGCGTGGAG ACAAAACACG 

251 AGGTACTCCA GAATAGAACC CATAAAAATT CAAATCCTCA GTAAGCTGCG 

3 01 CCTGGAAACA GCTCCTAACA TCAGCAAAGA TGCTATAAGA CAACTTCTGC 

.151 CAAGACCGCC TCCACTCCCG GAACTOATCC AT C ACT AC GA CGTCCAOAGG 

401 GATGACACCA GTGATGGCTC TTTGGAAGAT GACGATTATC ACGCTACCAC 

451 GGAAACAATC ATTACCATGC CTACAGAGTC TGACTTTCTA ATGCAACJCGG 

501 ATGGCAAGCC CAAATGTTGC TTTTTTAAAT TTAGCTCTAA AATACAGTAC 

551 AACAAAGTAG TAAAAGCCCA ACTGTGGATA TATCTCAGAC CCGTCAAGAC 

601 TCCTACAACA GTGTTTGTGC AAATCCTOAG ACTCATCAAA CCCATGAAAG 

651 ACCCTACAAG GTATACTGGA ATCC GATCTC TOAAACTTGA CATGAGCCCA 

701 GGCACTGGTA TTTGGCAGAG TATTGATGTG AAGACAGTGT TGCAAAATTG 

751 GCTCAAACAG CCTGAATCCA ACTTAGGCAT TGAAATCAAA GCTTTGGATG 

801 AGAATGGCCA TGATCTTCCT GTAACCTTCC CAGGACCAGG AGAAGATGGG 

851 CTGAATCCCT TTTTAGAACT CAAGGTGACA GACACACCCA AGAGGTCCCG 

901 GAG AG AC TTT GGGCTTGACT GCGATGAGCA CTCCACGGAA TCCCGGTGCT 

951 GCCGCTACCC CCTCACGGTC GATTTTGAAG GCTTTGGATG GGACTGGATT 

1001 ATCGCACCCA AAAGATATAA GGCCAATTAC TGCTCAGGAG AGTGTGAATT 

1051 TGTGTTTTTA CAAAAATATC C GCAT ACTCA TCTTGTGCAC CAAGCAAACC 

1101 CCAGAGGCTC AGCAGGCCCT TGCTGCACTC CGACAAAAAT GTCTCCCATT 

1151 / -aTATGCTAT ATTTTAATGG CAAAGAACAA ATAATATATG CiCJAAAATTCC 

1201 AGCCATGGTA GT AG AC C GC T CTCCGTGCTC ATGAGCTTTG CATTAGGTTA 

1251 GAAACTTCCC AACTCATGGA AGGTCTTCCC CTCAATTTCG AAACTGTGAA 
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1301 


TTCAAGOACC 


ACAGGCTGTA 


GGCCTTGAGT 


AT GCTCT ACT 


AACGTAAGCA 


1351 


CAAGCTACAG 


TGTATGAACT 


AAAAGAGAGA 


ATAGATGCAA 


TGGTTGGCAT 


1401 


TCAACCACCA 


AAATAAACCA 


TACTATAGGA 


TGTTGTATGA TTTCCAGAGT 


1451 


TTTTGAAATA 


GATGGAGATC 


AAATTACATT 


TATCTCCATA 


TATOTATATT 


1501 


ACAACTACAA 


TCTACCCAAG 


GAAGTGAGAG 


CACATCTTGT 


GGTCTGCTGA 


1551 


GTTAQGAQGG 


TATGATTAAA 


AGGTAAAGTC 


TTATTTCCTA 


ACAGTTTCAC 


1601 


TTAATATTTA 


CAGAACAATC 


TATATGTAGC 


CTTTGTAAAG 


TGTAGGATTG 


1651 


TTATCATTTA 


AAAACATCAT 


GTACACTTAT 


ATTTGTATTG 


TATACl^CT 


1701 


AAGATAAAAT 


TCCACAAAGT 


AGGAATGGGG 


CCTCACATAC 


ACATTGCCAT 


1751 


TCCTATTATA ATTGGACAAT 


CCACCACGGT 


GCTAATGCAG 


TGCTCAATGG 


1801 


CTCCTACTGG 


ACCTCTCGAT 


AGAACACTCT 


ACAAAGTACG 


AGTCTCTCTC 


1851 


TCCCTTCCAG 


GTGCATCTCC 


ACACACACAG 


C ACT AAGTGT 


T CAATGCATT 


1 301 


TTCTTTAAG G 


AAAGAAGAAT 


CTTTTTTTCT 


AGAGGTCAAC 


TTTCAGTCAA 


1951 


CTCTAGCACA 


GCGGGAGTGA 


CTGCTGCATC 


XTAAAAGGCA 


GCCAAACAGT 


2001 


ATTCATTTTT 


TAATCTAAAT 


TTCAAAATCA 


CTGTCTGCCT 


TTATCACATG 


2051 


. CCAATTTTGT 


GGTAAAATAA 


TGGAAATGAC 


TGGTTCTATC 


AATATTGTAT 


2101' 


AAAAGACTCT 


CAAACAATTA 


CATTTATATA 


ATATGTATAC 


AATATTGTTT 


2151 


TGTAAATAAG 


TGTCTCCTTT 


TATATTTACT 


TTGGTATATT 


TTTACACTAA 


2201 


TGAAATTTCA 


AATCATTAAA 


GTACAAAGAC 


ATGTCATGTA 


TCACAAAAAA 


2251 


GGTGACTGCT 


TCTATTTCAG 


AGTGAATTAG 


CAGATTCAAT 


AGTGGTCTTA 


2301 


AAACTCTGTA 


TGTTAAGATT 


AGAAGGTTAT 


ATTACAATCA 


ATTTATGTAT 


2351 


TTTTTACATT 


ATCAACTTAT 


GGTTTCATGG 


TGGCTGTATC 


TATGAATGTG 


2401 


GCTCCCAGTC 


AAATTTCAAT 


GCCCCACCAT 


TTTAAAAATT 


ACAAGCATTA 


2451 


CTAAACATAC 


CAACATGTAT 


CTAAAGAAAT 


ACAAATATGG 


T ATCTCAAT A 


2501 


ACAGCTACTT 


TTTTATTTTA 


TAATTTGACA 


ATGAATACAT 


TTCTTTTATT 


2551 


TACTTCAGTT 


TTATAAATTG 


GAACTTTGTT 


TATCAAATGT 


ATTGTACTCA 


2601 


TAGCTAAATG 


AAATTATTTC 


TTACATAAAA 


ATGTGTAGAA 


ACTATAAATT 


2651 


AAAGTGTTTT 


CACATTTTTC 


AAACCC 
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SEQUENCE ID NO. 6 



1 MMQKLQ&nrVY I YLFML IAAG PVDLNEGSER EENVEKEGLC NACAWRQNTR 

51 YSRIEAIKIQ ILSKLlRLETA PNISKDAIRQ LLPRAPPLiRE LIDQYI7VQRD 

101 DSSDGST.EDD DYHATTETI I TMPTESDFLM QADGKPKCCF FKFSSKIQYN 

151 KWKAQLWIY LRPVKTPTTV rVQILRliXKP MKDGTRYTGT RST.KT.77MSPG 

201 TGIWQSIDVK TVLQNWLKQP ESNLGIEIKA LDENGHDLAV TFPGPGEDGL 

251 NPFLEVKVTE TPKRSRRDFG LDCDEHSTES RCCRYPIjTVD FEAFGWDWII 

3 01 APKRYXANYC SGECEFVTLiQ KYPHTHLVHQ ANPRGSAGPC CTPTKMSPIN 

351 MLYFNGKEQI IYGKIPAMW DRCGCS* 
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SEQUENCE ID NO. 7 



PCT/IB98/01197 



ha3753 WGATTTTCTA ATOC AACTOGATCGAAAACCC 



he 3 7 5 3 AAATCTTGCTTCTTTAAATTTAOCTCTAAAATACAATACAATAAACTACTAAAGGCCCAA 



bo 37 S 3 CTATCGATATATTTGAGACCCGTCGAQACTCCTACAACAQTOTTTGTGCAAATCCTGAGA 



hc37 5 3 CTCATCAJU^CCTATGAAAQAjC<KrrACAACCTTATACTGGAATCCGATCTCTGAAACTTQAC 
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b«375a ATCAXCCCJU2GC&CTX3QTATTTGCXIAAA • HXnt^Ttrr<UUlGA£ACTQTTGGAAAATTClG 



fa*3 7 S .1 (WTCTTQCT(7rAACCTTCCCACCAGClA GGAAQAJICSXTCXJGCTfiA ATCC C T' I > T l > l w rAAOAA 



h«3 7 5 3 C^^CAAGOTAAGAGACACACX^AAAAOATTCCACMAGrGGATTTTGCCT^ 



h«37 5 3 TaAGCACTCAACAaAATCACCJVTCCTCTCOTTAC^ 



ho 3 7 53 TTCCGATCCOATTOOA-TATCO 



ho783 3 AQcaATOcrrAGrrAOAccacTaTCKKJTacrrr- 



bs7«23 ATOAGATTTATATTAAGCCTTCATAACT 
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h*782 3 TTTTCAACK^NrrCAAATTAAOTACCA^^ 



fa«7823 jUlQCATAAOCTA^CTATCTAAACTAAAAGCKK^^ 



h*7 02 J TTTAACCATCCAAACAAATCATAC - • CAGAAAGTTTTATGATTTCCA11AGTTTTTTKAQO 



h*7823 CNAOAAAOOAOGAflTCAAANTTTCANTCTT ATG QT 



1152027 



ATTTCGCX^CAGGTNAAACACTTGAATTTATATTGTATGGTAaTATA 



h9 2 02 7 CTT GGT AAGAT AAAATTC CA C AAAAAT A CK3 OATG Q T (3 CA GCAT AT Q CA - ATTTCCATTCC 



1*92027 



TATTATAATTGACACACTACATTAACAATCCATOCCAACGGTGCTAATACGATAGGCTGA 
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&9S327 TAAATCTC*ACOTTCC\TTATTTTAATACTTaCWAAACATTACT\AOTATACCAAAATA 



ATTGACTCTATTATC - TO- AAATOAAQ * AATAAACT0A7OCZATC7CAACAATAACTOTT 



1335327 A Cl'L'1'1 ' A'l'l**!*!' A TAATTTQA T AATOAAT AT ATTTCTQC1A TTTA TTT A LTl'^XTT UTTTTOTA 



a5 5 32 7 AATTGGCATTTTGTTAATCAAATTTATTGTACT - ATGACTAAATGAAATTATTTCTTACA 



nS5327 T - CTAA TTT OTAGAAACA^TATAA OTTAT AITAAAGTOTTTTCACA' l " I " l * l " l '' i " l t QAAAQA C 



GMCnnri rv >-*A/r-\ nnrnee? a 1 i _ 
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SEQUENCE ID NO. 8 



1 


MQKLQLCVYI 


YLFMLIVAGP 


VDLNENSEQK 


ENVEKEGLCN ACTWRQNTKS 


51 


SRIEAIKIQI 


LSKLRLETAP 


NISKDVIRQL 


LPKAPPLREL 


IDQYDVQRDD 


101 


SSDGSLEDDD 


YHATTETIIT 


MPTESDFLMQ 


VDGKPKCCFF 


KFSSKIQYNK 


151 


WKAQLWIYL 


RPVETPTTVF 


VQILRLIKPM 


KEGTRYTGIR 


SLKLDMNPGT 


201 


GIWQSIDVKT 


VLQNWLKQPE 


SNLGIEIKAL 


DENGHDLAVT 


FPGPGEDGLN 


251 


PFLEVKVTDT 


PKRSRRDFGL 


DCDEHSTESR 


CCRYPLTVDF 


EAFGWDWIIA 


301 


PKRYKANYCS 


GECEFVFLQK 


YPHTHLVHQA 


NPRGSAGPCC 


TPTKMSPINM 


351 


LYFNGKEQII 


YGKIPAMWD 


RCGCS* 
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SEQUENCE ID NO. 54 

1 GCGGCCGCCC GGGCAGGTAT CGAAAGTTTC ACATATAAAG AT GAAT AAGA 

51 TCTAAGTGTA TATGTTATTG TTAATAAAGT TTTTAATTTT TCGAATGTCA 

101 CATACAGCCT TTATTATTCA TAGATTTATT CCTTTTAAGA AG TAGT CAAA 

151 TGAATCAGCT CACCCTTGAC TGTAACAAAA TACTGTTTGG TGACTTGTGA 

201 C AG AC AG G G T TTTAACCTCT GACAGC GAGA TTCATTGTGG AGCAAGAGC C 

251 AATCACAGAT CCCGACGACA CTTGTCTCATCAAAGTTGGA ATATAAAAAG 

301 CCACTTGGAA T ACAG T AT AAAAGA.T T C AC T GGTGTGGCAA GTTGTCTCTA 

351 GACTGGGCAG GCATTAACGT TTGGCTTGGC GTTACTCAAA AGCAAAAGAA 

4 01 AAGTAAAAGG AAGAAGTAAG AACAAGGGAA AAGATTGTAT TGATTTTAAA 

4 51 AC C ATG C AAA AACTGCAAAT CTCTGTTTAT ATTTACCTAT TTATGCTGAT 

501 TGTTGCTGGC CCAGTGGATC T GAAT G AG AA, CAGCGAGCAG AAGGAAAATG 

551 TGGAAAAAGA GGGGCTGTGT AATGCATGTT TGTGGAGGGA AAAC AC T AC A 

601 TCCTCAAGAC TAGAAGC C AT AAAAAT C C AA ATCCTCAGTA AACTTCGCCT 
651 ' GGAAACAGCT CCTAACATCA GCAAAGATGC TAT C AG AC AA CTTTTGCCCA 

701 AGGCTCCTCC ACTCCTGGAA C T GAT T GAT C AGTTCGATGT CCAGAGAGAT 

751 GCCAGCAGTG ACGGCTCCTT GGAAGACGAT G AC T AC CAC G CCAGGACGGA 

8 01 AACGGTCATT ACCATGCCCA CGGAGT/GTGA GTAGTCCTGCTGGTGCAAAG 

851 CAACGACTCT GCTGACTGCT GTTCTAGTGT T CAT G AAAAA CCGATCTATT 

901 TTCAGGCTCT TTTAACAAGC TGCTGGCTTG TATGTAAGGA GGAGGGGAAA 

951 GAGCTTTTTT CAAGATTTCA TGAGAAATAG ACCAATGAGA CTGAAAGCTG 

1001 CTACTTTATT TGTTTCCTTA GAGAGC T AAA. AAGCT AAAAA T C AAAAAT G A 

1051 AATGCTTGCA TAGCATTCAT GTTATATAGT T TAG TAT G AC AACTATAACA 

1101 TGTTTATGTT TTCACAGCTT AATGCTACCA AGGTAAAGGA TTGGGAAACA 

1151 GTATCAGCAA T G T G AAAAAT T T AC AT CAAA TTTCCTAATT GCATTTGGTT 
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1201" GCCTGAAATA TGCATTTATA ATAACAGGTT TTTTTTTTTT CAT T AAT AAA 
1251 AG AGAAAG G A AGAAATCTGT AGAGGTTGAA GCCTATCTGG GCATTTGCTG 
1301 AACACTTAGA ATGACTTCTG TTATT CAAAA CTATTTCTCA TAGGGTTTTT 
1351 ATGGTCTTCA CAGAGTATCT AATTTTGAAA GCTATTAGAG TGGAAAGGAT 
14 01 AAAAGAATAT T C T T AAT AAA CTTAATGTAT TAG T AAG AG C AATAAGGAAG 

14 51 TAAACACAGC ATAG TGAAAA ATCATGAGCT AATCAGCAGA AAATTCTAAG 
1501 AAATAAACAT TTTAATTACA AAGTTCCACT TATACCCTGA CCATGGTACT 

15 51 ATTGTTGAGA GTACCTTGTC TGCACATATC TAGGAGGCAC AT G C T T AAT A 
1601 ACCTTCTAAA ATATTATTGT ATTCCTCATA GGAGGGAGAA CTATTACCTA 
1651 TATGTAGTAC CTATGTTGTT TCTGAAAGAT AATATGTTTC ATGTATTTCT . 
1701 GTTGCAGTCA CTTCAAACCT ATACTCAAGG AAAGGGAGAC AGGCATCTCA 
1751 ACAGAGAAGG CATGACCAGA AAGAGTTTTG TGCCATGTGT CTGCGATCTT 
18 01 GCTTTATACA GGGCTCTACC CACTTTAAAC TGGACTCAAA ACAGTTTCAA 
18 51 AATACTGCTT TTTCTTATTA AGTAACTAGT TTATAAGGCA ACAAATAAAT 
1901 TTCCTTTAAG ACTGTGCTAT C AG AT AAT C C T GGAAT AGAT TTGCCTTACT 
1951 TATAAACAAT CTTGAGAAAA CAAAAAGGCA AGAAATTGCT AAGTGCTTCT 
2 001 GCTTACAATG ACAGCCTGGC C C T AAAGAC A ATGTTTTCTA AGTTTTGAAA 
2051 CAGCTTGAAT . ACAACATCTA AGTTTTGGTG CTAATTACCT GCTAGTTTTT 
2101 TTATTTTTTT CCTTTAAAAG GCTGTCCCAG CGTCCTAACA TAACAGATGC 
2151 ACTATATTTT CTGCTAATTC CCGAGGCTCA GTTAGTTGCT CACTGTGTCT 
2201 TGTCCCCAGG TAATTCAGGC CTGGGGGAAG GGTTCCTTCC TCCAGACTGA 
22 51 TTGGTACAGC TGCTCAGTAA GTGTAACTAC TCAGATTCCC AAAGAAT T C T 
2 301 AAGTGGATGT TCTTCCACAG TGTCTCTTGT TCTCTCTAAT CATCATCATT 
2351 T T AAAAT T T C ATCCACTTTT CATTCCTTAA TAGAATTTTC CTTAGTCCAC 
2 401 AGTTCTCTGG AAAG G AAG T A GGCTTCTCAT AACAGCTGAA AAAACATATA 
2 4 51 C C T AAAAGAT T C T G AAAAGC TGTAATAACT GTTATACTTG ATATTTTGCT 
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3751 ATAAAGTCAA 

3801 AACCATTTTA 

3851 ATCTATGCTT 

3901 TACCTTTTAT 

3951 GCAAGCTTTA 

4 001 AAATGATATC 

4 051 TTAAGTCATT 

4101 AC AT AAAAT G 

4151 AT AC C AATAA 

4201 ATGTGTTCTT 

4251 ACAGAGGTCT 

4 301 AGACTTTTCC 

'4351 TACAGGAGGA 

4 401 TTTTTAGTGA 

4451 TGCCTCTCTC 

4 501 TCTTCATCCC 

4 551 TGGCA^CTAT 

4 601 CAAGTGAATG 

4 651 ACATATTTAG 

4701 CTTGTCCTTC 

4751 CCTTTACTGT 

4 801 ATTCAGATAC 

4 8 51 ACCACAGGGG 

4 901 GAATCAAGCC 

4 951 AGAAATGTGA 
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ACAGAAAATA ATGCCTTATA TAT T AT AAAA ATTAATAAAA 
AAAT C TAG T A TAAGTTTAGA GCTACTCACT CTTCTGGCTT 
GTATTTACTT CTGTTTTCAA AAAATTTTTT AATGTGACCA 
TTCCAGTTAT TGATATAATT TACAACAAAA GATTATACTT 
TAGTTTTTAA ATGGTCTTAT TTGTAGTGAA TATCATATCT 
TAAATGTAAA G T AAAT CAT A CCTAAATGAA AACATATTCT 
ATAAAATTTT CCAGGTGATC AATTTTTCTT TAAAT AT AC T 
TTATTGACTC CC AAAAT GAT GTTATTTTGT ATAATCTTAA 
TTACCAGGTC TATTTTGGTT- TTAGTGTAGG ATAAAAAAGA 
TTTTCTAGGT AG CAT T T T AA TGATCAAAGT TGGTGACGTG 
TAAGTATTAT TAAACAGATG ATTAATAAGA TGTATTCCTC 
ATATAAAAGG AAAAATGTCT CAAATTCATG AAAAGATTGG 
GGATTAGCAA ATTGTAGTTT AAATATCTGA AT G G AAAC AC 
AAGAATAAAG GGAATATCAT TGTATCTTCT TCTGAGTCTG 
TCTTGGAGTT AGTCTTTCCA ACCCTATATA C T T AC C AC T A 
TCTACCTTCC TTTTTCCCAT TACATCTGTG' CAGTACTGGG 
TGTGTTTCGG TGTTAATATC CAAGTTTCCC TGAATAAGAC 
GAGGATGAAT GAGTATACCT ATCCCTCCAG GGGTCATCAG 
CCACCATATT TAATCAATAA G C AG GAAGAC ATAAGCTAGC 
TTCTTTCCTC CCTGCTCCTT TCTCTTCTCT TCCCCCTCTC 
CATC CATC AG TATTTTCAGA GCATCTATTA TGTGTCAGGC 
TCAAACGGAG GAAAACAAGA ATAAACAAGA CAAAGATCTG 
AATCCCTATG GCTACTGTAG ACTTTTGAGC CATAAAGGAA 
TAGTGTAAAT GAAAATTCCT TAATGCTGTG CCTT T TAAAA 
CATAAGCA\A ATGATTAGTTTCTTT CTTTA AT AATGAGTC 
GAGAGTGTTT TGGGATCTAT TATTAAC TCT TCTTTCCTTT 
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